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TITLE OF THE INVENTION 

PRODUCTION OF PHYTATE DEGRADING ENZYMES 
EN TRICHODERMA 



BACKGROUND OF THE INVENTION 

5 The mesophilic filamentous fungus Trichoderma reesei is very efficient 

in secreting cellulase enzymes into the growth medium. In optimized 
cultivation conditions amounts, up to 40 g/I of extracellular cellulase have 
been reported (Durand et aL. Enzyme Microb. Technol. 70:341-346 (1988): 
Durand et aL, in Biochemistry and Generics of Cellulose- Degradation. 

10 Academic Press, 1988, pp. 135-151). 

Development of transformation systems for T. reesei (Knowles et aL. 
EP244,234: PenttilSe/ aL, Gene 61: 155-164 (1987): Berka et aL, EP215.594) 
has made possible the application of genetic engineering methods to the 
fungus. By genetic engineering, production profiles of different cellulase 

15 enzymes have been modulated e.g., to give strains with improved levels of the 
endoglucanase I enzyme. The strong cbh\ promoter has been applied to 
promote endoglucanase expression (Nevalainen ei aL . "The molecular biology 
of Trichoderma and its application to the expression of both homologous and 
heterologous genes." in Molecular Industrial Mycology, Leong and Berka. 

20 eds.. Marcel Dekker Inc., New York, pp. 129-148 (1991): and Harkki. A. 
et aL. Enzyme Microb. TechnoL 7J:227-233 (1991)). 

In addition to tailoring the production profiles of homologous proteins, 
the production potential of T. reesei has been harnessed to express various 
heterologous proteins in the fungus. So far examples are few and include 

25 e.g., calf chymosin (Knowles et aL. EP244.234: Berka etaL. EP215.594: 
Harkki. A. et aL. Bio/TechnoL 7:596-603 (1989): Uusitalo. J.M. et aL. J. 
BiotechnoL 77:35-50 (1991)). CBHl-Fab fusion antibodies raised against 2- 
phenyl-oxazolone (Nyyssonen et aL. WO92/01797) and a fungal ligninolytic 
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enzyme (Saloheimo. M. and Niku-Paavola. M.-L. Bio/Technol. 9:987-990 
(1991)). For improved expression the desired gene has been inserted into a 
cbh\ expression cassette and introduced into T. reesei by protoplast 
transformation (Harkki. A. et al.. Bio/Technol. 7:596-603 (1989): Nyyssonen 
et al.. WO92/01797: Saloheimo. M. and NMku-Paavola. M.-L. Bio/Technol. 
9:987-990 (1991)). Even though heterologous filamentous fungal promoters 
such as Aspergillus amdS. argB and glucoamylase (GA) can function in T. 
reesei at least to some extent (Penttila et al., Gene 67:155-164 (1987): 
Knowles et al.. EP244.234) efficient expression requires the use of a 
homologous promoter. In addition, better yields have been obtained in some 
cases by producing the desired gene product as a fusion protein (Harkki. A. 
et al., Bio/Technol. 7:596-603 (1989); Nyyssonen et dir. WO92/01797). The 
yields of heterologous proteins obtained from T. reesei have varied between 
10 - 150 mg/1. 

15 Phytate, a storage form of phosphorus in plant seeds, is pan of human 

and animal diets. Phytate phosphorus is poorly available to monogastrics, 
because it forms complexes with multivalent metal ions and binds to proteins. 
Thus degradation of phytate is of interest. Plant phytin degrading enzymes 
phytase and acid phosphatase for the conversion of phytate to inositol and 

20 inorganic phosphorus are produced e.g., by bacteria (Powar, V.K. and 
Jagannathan. W.L.J. Bacteriol. 75:1102-1108 (1982); Cosgrove, D.J., Aust. 
J. Biol. Sci. 25:1207-1220 (1970) and Cosgrove. D.J. et al., Aust. J. Biol. 
Sci. 25:339-343 (1970): yeasts (Nayini, N.R. and Markakis, P.. Lebensmittel 
Wissenschaft und Technologic 77:24-26 (1984)) and filamentous fungi 

25 comprising several Aspergillus species such as A. terreus (Yamada et al.. 
Agric. Biol. Chem. 52:1275-1282 (1968). A. ficuum (Gibson. D.M. 
Biotechnol. Lett. 9:305-310 (1987) and A. /i/g<?r (Shieh. T.R. and Ware. J.H.. 
Appl. Microbiol. 76:1348-1351 (1968)). For complete degradation of plant 
phytin. both phytase and pH 2.5 acid phosphatase are needed. 

30 Industrial applications involve remarkable higher production yields than 

the amounts produced by the natural reported strains. The gene coding for 



phytase has been recently isolated and characterized from A. ficuum (Van 
Gorcom et al. . EP420.358 or WO91/05053) and the production of phytase has 
been improved in A. niger by multiplying the copy number of the gene in an 
expression cassette containing a strong homologous Aspergillus promoter e.g., 
GA (Van Gorcom et al.. EP420.358 or W09 1/05053). A gene coding for 
acid phosphatase has been isolated and characterized from A. niger (MacRae 
etaL. Gene 77:339-348 (1988)). 

NUMMARY OF THE INVENTION 

Recognizing the need for better production methods of phytase and 
pH2.5 acid phosphatase, and for compositions containing the same, the 
inventors have developed highly efficient methods for the recombinant 
production thereof. 

According to the invention, there is first provided a method for 
overexpressing phytate degrading enzymes in Trichoderma. 

There are further provided methods for overexpressing recombinant 
Aspergillus niger phytase and pH2.5 acid phosphatase enzymes in Trichoderma 
and secreting such enzymes therefrom. 

There are further provided expression vectors containing genetic se- 
quences encoding such enzymes, and Trichoderma host cells transformed with 
such expression vectors. 

There are further provided compositions comprising one or more of the 
Trichoderma-sy nihesiztd. recombinant phytase-degrading enzymes of the 
invention. 

There are further provided methods for the use of such compositions 
in feed and other such methods comprising food compositions, especially for 
animals. 
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BRIEF DESCRIPT ION OF THF DRAWINGS 

Figure 1. Sequence of peptide #816 ([SEQ ID No. :57:]), oligo PHY- 
31 (fSEQ ID No. :64:1), peptide #1110 ([SEQ ID No. :62:]), oligo PHY-34 
([SEQ ID No. :65:]) and oligo PHY-35 ([SEQ ID No. :52:1). P H2.5 acid 
phosphatase oligonucleotide PHY-31 is a 17mer mixture with 64 fold 
degeneracy and a single inosine. Peptide #816 is derived from an 
endoproteinase Lys-C digestion of purified native acid phosphatase. PHY-34 
is a 17mer mixture with 128 fold degeneracy. PHY-35 is a 17mer mixture 
with 64 fold degeneracy. Both PHY-34 and PHY-35 are necessary for 
complete representation of Peptide #1 1 10. Peptide #1 1 10 is derived from a 
trypsin digestion of purified native acid phosphatase. 

Figure 2. Nucleotide sequence from the 2.1 kb Sphl fragment 
containing the pH 2.5 acid phosphatase gene [SEQ ID No. :1:] with deduced 
amino acid translation [SEQ ID No. :2:]. The intron donorr lariat and 
acceptor sequence as determined by cDNA sequencing are overlined. The 
nucleotide sequence corresponding to peptides #816 ([SEQ ID No. :57:]) and 
#1110 ([SEQ ID No. :62:]) is underlined. The genomic nucleotide sequence 
was determined by the M13-dideoxy method (Sanger, F., etal., Proc. Natl. 
Acad. ScL USA 74:5463-5467 (1977) with the use of the United States 
20 Biochemical Sequencase II kit. 

Figure 3. The amino acid sequences of the phytase trvptic peptides 
#792 [SEQ ID No. :43:] and #420 [SEQ ID No. :23:] and the deduced 
oligonucleotides [SEQ ID Nos. :3:, :4:, :5: and :6:[ used in the production of 
the phytase probe by nested PCR amplification. 

Figure 4. Plasmid pALK169. The map of pALK169 containing the 2.4 
kb Sphl insen and showing the restriction map of the insert. The location of 
the phytase gene is shown by an arrow. The hybridization site for the 350 bp 
PCR fragment in the phytase sequence is shown by an intensified line. 

Figure 5. The nucleotide sequence of the phytase gene. [SEQ ID Nos. 
30 :7: (DNA) and :8: (amino acid)!. 



15 



25 
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Figure 6. The PCR primers used for making the cbh 1 -phytase fusion 
fragments fSEQ ID Nos. :9: and :10: and :11: and :12:|. 

Figure 7. Construction of pALK171. The phytase gene with its own 
signal sequence was fused to the cbh\ promoter. Only the relevant restriction 
5 sites are shown. 

Figure 8. Construction of pALK172. The phytase gene was fused to 
the cbh\ signal sequence. Only the relevant restriction sites are shown. 

Figure .9. Plasmids pALX173A and pALK173B. The maps of the 
plasmids containing the phytase gene with its own promoter and the selection 
10 marker, amdS gene, are shown. In the plasmid, pALK173A the transcriptional 
orientation of the phytase and arndS genes is the same; and in the plasmid 
pALK173B. the transcriptional orientation of these two genesis opposite to 
each other. 

Figure 10. Western blots of the samples from the culture supernatants 

15 of the Trichoderma host strains and transformants producing phytase. Lane 1: 
50 ng of purified Aspergillus ALKO 243 phytase; Lane 2; 15 ng of endoF- 
treated Aspergillus ALKO 243 phytase; Lanes 3 and 10: T. reesei ALKO 233; 
Lanes 4-5 and 1 1-12: T. reesei ALKO 233 transformant 171FR/ A4 and A 13. 
respectively; Lanes 6 and 13: T. reesei ALKO 2221; Lanes 7-8 and 14-15: 

20 T. reesei ALKO 2221 transformant 171FR/A5 and A9. respectively; Lane 9: 
T. reesei ALKO 2221 transformant D2; Lane 16: T. reesei ATCC56765: 
Lanes 17. 18. 19: T. reesei ATCC56765 transformants 171FR/A2L All. and 
A23. respectively. In each case 2 /xl of 1:10 dilution of the culture supernatant 
were run in the gel. 171 FR: the host transformed with the Xbal fragment 

25 from the plasmid pALK171. 

Figure 11. The PCR primers used for making the cbh\ - pH 2.5 acid 
phosphatase fusion fragments [SEQ ID Nos. :13: and :14: and :15:]. 

Figure 12. Construction of the plasmid pALK533. The pH 2.5 acid 
phosphatase gene with its own signal sequence was fused to the cbh\ 

30 promoter. 
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Figure 13. Construction of the plasmid pALK532. The pH 2.5 acid 
phosphatase gene was fused to the cbh\ signal sequence and promoter. 

Figure 14. Western blot of the Trichoderma transformants producing 
pH 2.5 acid phosphatase. Lane 1 : lOng of purified Aspergillus ALKO 243 pH 
5 2.5 acid phosphatase: Lane 2: lOng of endoF treated Aspergillus ALKO 243 
pH 2.5 acid phosphatase; and Lanes 3-9: 60ng of protein from the each of the 
culture supernatants of Trichoderma reesei ALKO 2221 transformants SC-9. 
KA-31, KA-17. KB-44. KB- 18, SB-4 and KA-28, respectively. 

DETAILED DESCRIPTION OF THE INVENTION 
10 I. DEFINITIONS 

In the description that follows, a number of terms used in recombinant 
DNA (rDNA) technology are extensively utilized. In order to provide a clear 
and consistent understanding of the specification and claims, including the 
scope to be given such terms, the following definitions are provided. 

15 Gene. A DNA sequence containing a template for an RNA 

polymerase. The RNA transcribed from a gene may or may not code for a 
protein. RNA that codes for a protein is termed messenger RNA (mRNA) and. 
in eukaryotes, is transcribed by RNA polymerase II. A gene containing a 
RNA polymerase II template (as a result of a RNA polymerase II promoter) 

20 wherein an RNA sequence is transcribed which has a sequence complementary 
to that of a specific mRNA, but is not normally translated may also be 
constructed. Such a gene construct is herein termed an "antisense RNA gene" 
and such an RNA transcript is termed an "antisense RNA." Antisense RNAs 
are not normally translatable due to the presence of translational stop codons 

25 in the antisense RNA sequence. 

A "complementary DNA" or "cDNA" gene includes recombinant genes 
synthesized by. for example, reverse transcription of mRNA. thus lacking 
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intervening sequences (introns). Genes clones from genomic DNA may or may 
not contain introns. 

Cloning vehicle . A plasmid or phage DNA or other DNA sequence 
which is able to carry genetic information, specifically DNA. into a host cell. 

5 A cloning vehicle is often characterized by one or a small number of 
endonuclease recognition sites at which such DNA sequences may be cut in 
a determinable fashion without loss of an essential biological function of the 
vehicle, and into which a desired DNA may be spliced in order to bring about 
its cloning into the host cell. The cloning vehicle may further contain a 

10 marker suitable for use in the identification of cells transformed with the 
cloning vehicle, and origins of replication that allow for the maintenance and 
replication of the vehicle in one or more prokaryotic or eukaryotic hosts. 
Markers, for example, are tetracycline resistance or ampicillin resistance. The 
word "vector" is sometimes used for "cloning vehicle." A "plasmid" is a 

15 cloning vehicle, generally circular DNA, that is maintained and replicates 
autonomously in at least one host cell. 

Expression vehicle . A vehicle or vector similar to a cloning vehicle 
but which supports expression of a gene that has been cloned into it, after 
transformation into a host. The cloned gene is usually placed under the 

20 control of (i.e. . operably linked to) certain control sequences such as promoter 
sequences, that may be provided by the vehicle or by the recombinant 
construction of the cloned gene. Expression control sequences will vary 
depending on whether the vector is designed to express the operably linked 
gene in a prokaryotic or eukaryotic host and may additionally contain 

25 transcriptional elements such as enhancer elements (upstream activation 
sequences) and termination sequences, and/or translational initiation and 
termination sites. 

Host . A host is a cell, prokaryotic or eukaryotic. that is utilized as the 
recipient and carrier of recombinant material. 
30 Eukarvotic host . A "eukaryotic host" may be any cell from a 

eukarvotic organism, including, for example, animal, plant, fungi and yeast. 
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Host of the Invention . The "host of the invention" is a filamentous 
tungus host that has been engineering to produce recombinant phytase and/or 
pH 2.5 acid phosphatase according to the methods of the invention. 

Functional Derivative . A "functional derivative" of a protein or 
5 nucleic acid, is a molecule that has been chemically or biochemically derived 
from (obtained from) such protein or nucleic acid and which retains a 
biological activity (either functional or structural) that is a characteristic of the 
native protein or nucleic acid. A "mutant" of a protein or nucleic acid is a 
biochemical or chemical derivative of such protein or nucleic acid. The term 
10 "functional derivative" is intended to include "mutants," "fragments." 
"variants," "analogues," or "chemical derivatives" of a molecule that retain a 
desired activity of the native molecule. "~ 

As used herein, a molecule is said to be a "chemical derivative" of 
another molecule when it contains additional chemical moieties not normally 
15 a part of the molecule. Such moieties may improve the molecule^ solubility, 
absorption, biological half life, etc. The moieties may decrease the toxicity 
of the molecule, or eliminate or attenuate any undesirable side effect of the 
molecule, etc. Moieties capable of mediating such effects are disclosed in 
Remington's Pharmaceutical Sciences (1980). Procedures for coupling such 
20 moieties to a molecule are well known in the an. 

Fragment . A "fragment" of a molecule such as a protein or nucleic 
acid is meant to refer to a portion of the native amino acid or nucleotide 
genetic sequence, and in particular the functional derivatives of the invention. 

Variant or Analog . A "variant" or "analog" of a protein or nucleic 
25 acid is meant to refer to a molecule substantially similar in structure and 
biological activity to either the native molecule, such as that encoded by a 
functional allele. 
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II. THE HOSTS OF THE INVENTION 

T. reesei does not produce endogenous phytase. Instead, other enzyme 
components such as /3-glucan degrading activity, important in e.g. feed 
applications, are produced in high amounts. Thus the use of T. reesei as a 

5 production host for fungal phytase and pH 2.5 acid phosphatase results in 
secretion of a totally different enzyme composition when compared to that 
secreted from Aspergillus. In addition, by using Trichoderma as a source of 
a composition containing phytate degrading enzymes, some difficult problems 
in downstream processing that occur with similar Aspergillus compositions 

10 (e.g., in filtration) are avoided. This is because the mode of growth of the 
recombinant T. reesei is different than that of Aspergilli. the mycelium being 
most often fluid and easily separable. Thus, by producing these enzymes in 
the hosts of the invention, no problems in subsequent filtration of the secreted 
material is seen, as is the case with the often slimy and thick mycelium of 

15 Aspergilli. 

Improved amounts of phytase and pH 2.5 acid phosphatase (as 
compared to synthesis in Aspergillus) can be produced in the T. reesei 
expression system by inserting a DNA sequence obtained from A. niger. 
coding for phytase or pH 2.5 acid phosphatase activity, into a T. reesei 
20 expression cassette containing the cbhl promoter and the Aspergillus amdS 
gene as a transformation marker. Transformation of the construct to T. reesei 
hosts results in stable transformants expressing the phytase or pH 2.5 acid 
phosphatase in high amounts in a novel background of accompanying enzyme 
activities. 

25 The mixture produced by T. reesei contains high 0-glucanase activity 

and low glucoamylase activity. Moreover, the amount of phytase produced 
by recombinant T. reesei strains in shake flask cultivations is comparable to 
the level of which the main cellulase. the endogenous cellobiohydrolase I. is 
expressed, more than 1 g/L The amount of the pH 2.5 acid phosphatase 
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produced by ihe recombinant strains in shake flask cultivations is less than 0.5 



2/1. 



Aspergillus niger var. awamori ALKO 243 (ATCC 38854) (IFO4033) 
phytase and acid phosphatase (optimum P H 2.5) were overexpressed in 
Trichoderma reesei under the control of the Trichoderma cellobiohydrolase 1 
(cbh\) promoter. In addition, the phytase gene was expressed from its own 
promoter. 

For both the genes, two constructions utilizing the cbhl promoter were 
made: in one construction the phytase or acid phosphatase signal sequence was 
used and in the other construction the cbhl signal sequence was used. In all 
cases, the fusions were made precise by using PCR and the plasmids were 
constructed so that the expression cassette could be seplrated from the vector 
backbone prior to transformations. Thus it was possible to transform strains 
with only the desired sequences (and not the entire vector used for maintaining 
the sequences) and thus to obtain strains that did not contain any "foreign- 
sequences; such strains were suitable for industrial purposes. 

Three Trichoderma reesei strains, ATCC 56765 (RutC-30), ALKO 233 
(VTT-D-79125) and a low aspartyl protease producing strain ALKO 2221 
were used as hosts for phytase expression. For acid phosphatase expression, 
only T. reesei ALKO 2221 was transformed. When phytase was expressed 
under the cbh\ promoter in Trichoderma, the best transformation with no 
E. coli sequences produced in shake flask cultivations about 3.600 fold more 
phytase than the nontransformed A. niger ALKO 243. When the phytase 
promoter was used, the best yield obtained in shake flask cultivations of T. 
reesei transformants was about 120 fold that obtained with A. niger ALKO 
243. The best acid phosphatase activities obtained were about 240 fold higher 
compared to the levels produced by the A. niger ALKO 243 strain. 

The molecular weights (in SDS-PAGE) of the phytase and pH 2.5 acid 
phosnhatase secreted by Trichoderma were different from those secreted by 
Aspergillus. The difference seemed to be due to different glycosylate. 
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The production level of phyiase obtained when the Aspergillus gene 
was expressed in Trichoderma under the control of a Trichoderma promoter 
was surprisingly high. 

The use of 7. reesei as a production host for fungal phytase and pH 2.5 

5 acid phosphatase results in totally different enzyme preparations as compared 
to that from Aspergillus. When compared to Aspergillus preparations, the 
mixtures produced by 7. reesei contain substantially higher )3-glucanase and 
proportionally .lower glucoamylase activities thus making 7. reesei preparations 
preferable to be used e.g. in animal feed. 

10 The hosts of the invention are meant to include all Trichoderma. 

Trichoderma are classified on the basis of morphological evidence of 
similarity. 7. reesei was formerly known as T viride Pers. or T koningii 
Oudem; sometimes it was classified as a distinct species of the T. 
longibrachiatum group. The entire genus Trichoderma, in general, is 

15 characterized by rapidly growing colonies bearing tufted or pustulate, 
repeatedly branched conidiophores with lageniform phialides and hyaline or 
green conidia borne in slimy heads (Bissett. J., Can. J. Bot. 62:924-931 
(1984)). 

The fungus called 7. reesei is clearly defined as a genetic family 
20 originating from the strain QM6a, that is, a family of strains possessing a 
common genetic background originating from a single nucleus of the particular 
isolate QM6a. Only those strains are called 7. reesei. 

Classification by morphological means is problematic and the first 
recently published molecular data from DNA-fmgerprint analysis and the 
25 hybridization pattern of the cellobiohydrolase 2 (cbh2) gene in 7. reesei and 
7. longibrachiatum clearly indicates a differentiation of these strains (Meyer. 
W. et aL. Curr. Genet. 27:27-30 (1992); Morawetz. R. et aL. Curr. Genet. 
27:31-36 (1992)). 

However, there is evidence of similarity between different Trichoderma 
30 species at the molecular level that is found in the conservation of nucleic acid 
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and amino acid sequences of macromolecular entities shared by the various 
Trichoderma species. For example. Cheng, C. et al., Nucl. Acids. Res. 
75:5559 (1990), discloses the nucleotide sequence of T. viride cbhl. The gene 
was isolated using a probe based on the T. reesei sequence. The authors note 
that there is a 95% homology between the amino acid sequences of the T. 
viride and T. reesei gene. Goldman, G.H. et al., Nucl. Acids Res. 75:6717 
(1990), discloses the nucleotide sequence of phosphoglycerate kinases from T. 
viride and notes that the deduced amino acid sequence is 81 % homologous 
with the phosphoglycerate kinase gene from T. reesei. Thus, the species 
classified to T. viride and T. reesei must genetically be very close to each 



other 



In addition, there is a high similarity of transformation conditions 
among the Trichoderma. Although practically all the industrially important 
species of Trichoderma can be found in the formerly discussed Trichoderma 
section Longibrachiatum, there are some other species of TrichodeTma that are 
not assigned to this section. Such a species is. for example, Trichoderma 
harzianum, which acts as a biocontrol agent against plant pathogens. A 
transformation system has also been developed for this Trichoderma species 
(Herrera-Estrella. A. et al., Molec. Microbiol. 4:839-843 (1990)) that is 
essentially the same as that taught in the application. Thus, even though 
Trichoderma harzianum is not assigned to the section Longibrachiatum. the 
method used by Herrera-Estrella in the preparation of spheroplasts before 
transformation is the same. The teachings of Herrera-Estrella show that there 
is not a significant diversity of Trichoderma spp. such that the transformation 
system of the invention would not be expected to function in all Trichoderma. 

Further, there is a common functionality of fungal transcriptional 
control signals among fungal species. At least three A. nidulans promoter 
sequences. amdS. argB. and gpd. have been shown to give rise to gene 
expression in T. reesei. For amdS and argB. only one or two copies of the 
gene are sufficient to being about a selectable phenotypes (Penttila et al. . Gene 
67:155-164 (1987). Gruber. F. et al.. Curr. Generic 75:71-76 (1990) also 
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notes that that fungal genes can often be successfully expressed across different 
species. 

Many species of Trichoderma are available from a wide variety of 
resource centers that contain fungal culture collections. In addition. 
5 . Trichoderma species are catalogued in various databases. These resources and 
databases are summerized by O'Donnell, K. et aL. in Biochemistry of 
Filamentous Fungi: Technology and Products, D.B. Finkelstein et aL, eds.. 
Butterworth-Heinemann. Stoneham. MA. USA. 1992. pp. 3-39. 



III. CONSTRUCTION OF THE HOSTS OF THE INVENTION 

10 The process for genetically engineering the hosts of the invention, 

according to the invention, is facilitated through the isolation and partial 
sequencing of pure protein encoding an enzyme of interest or by the cloning 
of genetic sequences which are capable of encoding such protein with 
polymerase chain reaction technologies; and through the expression of such 

15 genetic sequences. As used herein, the term "genetic sequences" is intended 
to refer to a nucleic acid molecule (preferably DNA). Genetic sequences 
which are capable of encoding a protein are derived from a variety of sources. 
These sources include genomic DNA, cDNA, synthetic DNA. and 
combinations thereof. The preferred source of genomic DNA is a fungal 

20 genomic library. The preferred source of the cDNA is a cDNA library 
prepared from fungal mRNA grown in conditions known to induce expression 
of the desired mRNA or protein. 

The genomic DNA of the invention may or may not include naturally 
occurring introns. Moreover, such genomic DNA may be obtained in 

25 association with the 5' promoter region of the gene sequences and/or with the 
3' transcriptional termination region. Further, such genomic DNA may be 
obtained in association with the genetic sequences which encode the 5' non- 
translated region of the mRNA and/or with the genetic sequences which 
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encode the 3' non-translated region. To the extent that a host cell can 
recognize the transcriptional and/or translational regulatory signals associated 
with the expression of the mRNA and protein, then the 5' and/or 3' non- 
transcribed regions of the native gene, and/or. the 5' and/or 3' non-translated 
regions of the mRNA may be retained and employed for transcriptional and 
translational regulation. Genomic DNA can be extracted and purified from 
any host cell, especially a fungal host cell, which naturally expresses the 
desired protein by means well known in the an. 

For cloning into a vector, such suitable DNA preparations (either 
genomic DNA or cDNA) are randomly sheared or enzymatically cleaved, 
respectively, and ligated into appropriate vectors to form a recombinant gene 
(either genomic or cDNA) library. ^ 

A DNA sequence encoding a desired protein or its functional 
derivatives may be inserted into a DNA vector in accordance with 
conventional techniques, including blunt-ending or staggered-ending termini 
for ligation, restriction enzyme digestion to provide appropriate termini, filling 
in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and ligation with appropriate ligases. Techniques for such 
manipulations are disclosed by Maniatis, T., (Maniatis, T. et al. % Molecular 
Cloning (A Laboratory Manual), Cold Spring Harbor Laboratory, second 
edition. 1988) and are well known in the an. 

Libraries containing sequences coding for the desired gene may be 
screened and the desired gene sequence identified by any means which 
specifically selects for a sequence coding for such gene or protein such as. for 
example, a) by hybridization with an appropriate nucleic acid probe(s) 
containing a sequence specific for the DNA of this protein, or b) by 
hybridization-selected translational analysis in which native mRNA which 
hybridizes to the clone in question is translated in vitro and the translation 
products are further characterized, or. c) if the cloned genetic sequences are 
themselves capable of expressing mRNA, by immunoprecipitation of a 
translated protein product produced by the host containing the clone. 
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Oligonucleotide probes specific for a certain protein which can be used 
to identify clones to this protein can be designed from the knowledge of the 
amino acid sequence of the protein or from the knowledge of the nucleic acid 
sequence of the DNA encoding such protein or a related protein. 
5 Alternatively, antibodies may be raised against purified forms of the protein 
and used to identify the presence of unique protein determinants in 
transformants that express the desired cloned protein. The sequence of amino 
acid residues in a peptide is designated herein either through the use of their 
commonly employed three-letter designations or by their single-letter 

10 designations. A listing of these three-letter and one-letter designations may be 
found in textbooks such as Biochemistry, Lehninger, A., Worth Publishers, 
New York, NY (1970). When the amino acid sequence is listedJiorizontally, 
unless otherwise stated, the amino terminus is intended to be on the left end 
and the carboxy terminus is intended to be at the right end. Similarly, unless 

15 otherwise stated or apparent from the context, a nucleic acid sequence is 
presented with the 5' end on the left. 

Because the genetic code is degenerate, more than one codon may be 
used to encode a particular amino acid (Watson, J.D., In: Molecular Biology 
of the Gene, 3rd Ed., W.A. Benjamin, Inc.. Menlo Park, CA (1977), pp. 356- 

20 357). The peptide fragments are analyzed to identify sequences of amino acids 
which may be encoded by oligonucleotides having the lowest degree of 
degeneracy. This is preferably accomplished by identifying sequences that 
contain amino acids which are encoded by only a single codon. 

Although occasionally an amino acid sequence may be encoded by only 

25 a single oligonucleotide sequence, frequently the amino acid sequence may be 
encoded by any of a set of similar oligonucleotides. Importantly, whereas all 
of the members of this set contain oligonucleotide sequences wh' .:h are capable 
of encoding the same peptide fragment and. thus, potentially contain the same 
oligonucleotide sequence as the gene which encodes the peptide fragment, only 

30 one member of the set contains the nucleotide sequence that is identical to the 
exon coding sequence of the gene. Because this member is present within the 
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set. and is capable of hybridizing to DNA even in the presence of the other 
members of the set. it is possible to employ the unfractionated set of 
oligonucleotides in the same manner in which one would employ a single 
oligonucleotide to clone the gene that encodes the peptide. 
5 Using the genetic code, one or more different oligonucleotides can be 

identified from the amino acid sequence, each of which would be capable of 
encoding the desired protein. The probability that a particular oligonucleotide 
will, in fact, constitute the actual protein encoding sequence can be estimated 
by considering abnormal base pairing relationships and the frequency with 
10 which a particular codon is actually used (to encode a particular amino acid) 
in eukaryotic cells. Using "codon usage rules," a single oligonucleotide 
sequence, or a set of oligonucleotide sequences, that "contain a theoretical 
"most probable" nucleotide sequence capable of encoding the protein 
sequences is identified. 
15 The suitable oligonucleotide, or set of oligonucleotides, which is 

capable of encoding a fragment of a certain gene (or which is complementary 
to such an oligonucleotide, or set of oligonucleotides) may be synthesized by 
means well known in the art (see, for example. Synthesis and Application of 
DNA and RNA, S.A. Narang, ed., 1987, Academic Press, San Diego, CA) 
20 and employed as a probe to identify and isolate a clone to such gene by 
techniques known in the an. Techniques of nucleic acid hybridization and 
clone identification are disclosed by Maniatis. T., et al., in: Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratories. Cold Spring 
Harbor, NY (1982)), and by Hames. B.D., et al., in: Nucleic Acid 
25 Hybridization, A Practical Approach. IRL Press, Washington. DC (1985)). 
Those members of the above-described gene library which are found to be 
capable of such hybridization are then analyzed to determine the extent and 
nature of coding sequences which they contain. 

To facilitate the detection of a desired DNA coding sequence, the 
30 above-described DNA probe is labeled with a detectable group. Such 
detectable group can be any material having a detectable physical or chemical 



property. Such materials have been well-developed in the field of nucleic acid 
hybridization and in general most any label useful in such methods can be 
applied to the present invention. Particularly useful are radioactive labels, 
such as 3 -P. 3 H. U C. "S. i:5 I, or the like. Any radioactive label may be 
employed which provides for an adequate signal and has a sufficient half-life. 
If single stranded, the oligonucleotide may be radioactively labelled using 
kinase reactions. Alternatively, polynucleotides are also useful as nucleic acid 
hybridization probes when labeled with a non-radioactive marker such as 
biotin, an enzyme or a fluorescent group. 

Thus, in summary, the elucidation of a partial protein sequence, 
permits the identification of a theoretical "most probable" DNA sequence, or 
a set of such sequences, capable of encoding such a peptide. By-constructing 
an oligonucleotide complementary to this theoretical sequence (or by 
constructing a set of oligonucleotides complementary to the set of "most 
probable" oligonucleotides), one obtains a DNA molecule (or set of DNA 
molecules), capable of functioning as a probe(s) for the identification and 
isolation of clones containing a gene. 

In an alternative way of cloning a gene, a library is prepared using an 
expression vector, by cloning DNA or, more preferably cDNA prepared from 
a cell capable of expressing the protein into an expression vector. The library 
is then screened for members which express the desired protein, for example, 
by screening the library with antibodies to the protein. 

The above discussed methods are, therefore, capable of identifying 
genetic sequences which are capable of encoding a protein or biologically 
active or antigenic fragments of this protein. In order to further characterize 
such genetic sequences, and. in order to produce the recombinant protein, it 
is desirable to express the proteins which these sequences encode. Such 
expression identifies those clones which express proteins possessing charac- 
teristics of the desired protein. Such characteristics may include the ability to 
specifically bind antibody, the ability to elicit the production of antibody which 
are capable of binding to the native, non- recombinant protein, the ability to 
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provide a enzymatic activity to a cell that is a property of the protein, and the 
ability to provide a non-enzymatic (but specific) function to a recipient cell, 
among others. 

A DNA sequence may be shortened by means known in the an to 
isolate a desired gene from a chromosomal region that contains more 
information than necessary for the utilization of this gene in the hosts of the 
invention. For example, restriction digestion may be utilized to cleave the full- 
length sequence at a desired location. Alternatively, or in addition, nucleases 
that cleave from the 3'-end of a DNA molecule may be used to digest a certain 
sequence to a shortened form, the desired length then being identified and 
purified by gel electrophoresis and DNA sequencing. Such nucleases include, 
for example, Exonuclease III and fla/31. Other nucleases are well known in 
the art. 

If the coding sequence and an operably linked promoter are introduced 
into a recipient eukaryotic cell as a non-replicating DNA (or RNA), non- 
integrating molecule, the expression of the encoded protein may occur through 
the transient (nonstable) expression of the introduced sequence. 

Preferably the coding sequence is introduced on a DNA (or RNA) 
molecule, such as a closed covalent circular molecule that is incapable of 
autonomous replication, or preferable a linear molecule that integrates into the 
host chromosome. Genetically stable transformants may be constructed with 
vector systems, or transformation systems, whereby a desired DNA is 
integrated into the host chromosome. Such integration may occur de novo 
within the cell or. be assisted by transformation with a vector which 
functionally inserts itself into the host chromosome, for example, transposons 
or other DNA elements which promote integration of DNA sequences in 
chromosomes. A vector is employed which is capable of integrating the 
desired gene sequences into a fungal host cell chromosome. 

The genes coding for phytase or pH 2.5 acid phosphatase under the 
control of suitable promoters may be combined in one plasmid construction 
and introduced into the host cells by transformation. The nature of the 
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piasmid vector will depend on the host organism. In the practical realization 
of the invention the filamentous fungus Trichoderma has been employed as a 
model. Thus, for Trichoderma and especially for T. reesei. vectors 
incorporating DNA that provides for integration of the sequences encoding the 
phytase or pH 2.5 acid phosphatase genes into the host's chromosome are 
preferred. Such targeting to, for example, the cbhl locus may be achieved by 
providing cbhl coding or flanking sequences on the recombinant construct, in 
an amount sufficient to direct integration to this locus at a relevant frequency. 

Cells which have stably integrated the introduced DNA into their 
chromosomes are selected by also introducing one or more markers which 
allow for selection of host cells which contain the expression vector in the 
chromosome, for example the marker may provide biocide resistance, e.g., 
resistance to antibiotics, or heavy metals, such as copper, or the like. The 
selectable marker gene can either be directly linked to the DNA gene 
sequences to be expressed, or introduced into the same cell by co- 
transfection. A genetic marker especially for the transformation of the hosts 
of the invention is amdS* encoding acetamidase and thus enabling Trichoderma 
to grow on acetamide as the onh nitrogen source. 

To express a desired protein and/or its active derivatives, 
transcriptional and translation^ signals recognizable by an appropriate host are 
necessary. The cloned coding sequences, obtained through the methods 
described above, and preferably in a double-stranded form, may be operably 
linked to sequences controlling transcriptional expression in an expression 
vector, and introduced into a host cell, either prokaryote or eukaryote, to 
produce recombinant protein or a functional derivative thereof. Depending 
upon which strand of the coding sequence is operably linked to the sequences 
controlling transcriptional expression, it is also possible to express antisense 
RNA or a functional derivative thereof. 

Expression of the protein in different hosts may result in different post- 
radiational modifications which may alter the properties of the protein. 
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Preferably, the present invention encompasses the expression of the protein or 
a functional derivative thereof, in eukaryotic cells, and especially in fungus. 

A nucleic acid molecule, such as DNA. is said to be "capable of 
expressing" a polypeptide if it contains expression control sequences which 
contain transcriptional regulatory information and such sequences are 
"operably linked" to the nucleotide sequence which encodes the polypeptide. 

An operable linkage is a linkage in which a sequence is connected to 
a regulatory sequence (or sequences) in such a way as to place expression of 
the sequence under the influence or control of the regulatory sequence. Two 
DNA sequences (such as a coding sequence and a promoter region sequence 
linked to the 5' end of the coding sequence) are said to be operably linked if 
induction of promoter function results in the transcription of mRNA encodino 
the desired protein and if the nature of the linkage between the two DNA 
sequences does not (1) result in the introduction of a frame-shift mutation, 
(2) interfere with the ability of the expression regulatory sequences to direct 
the expression of the protein, antisense RNA, or (3) interfere with the ability 
of the DNA template to be transcribed. Thus, a promoter region would be 
operably linked to a DNA sequence if the promoter was capable of effecting 
transcription of that DNA sequence. 

The precise nature of the regulatory regions needed for gene expression 
may vary between species or cell types, but shall in general include, as 
necessary, 5' non-transcribing and 5' non-translating (non-coding) sequences 
involved with initiation of transcription and translation respectively, such as 
the TATA box. capping sequence, CAAT sequence, and the like. Especially, 
such 5' non-transcribing control sequences will include a region which 
contains a promoter for transcriptional control of the operably linked gene. 
Such transcriptional control sequences may also include enhancer sequences 
or upstream activator sequences, as desired. 

Expression of a protein in eukaryotic hosts such as fungus requires the 
use of regulatory regions functional in such hosts, and preferably fungal 
regulatory systems. A wide variety of transcriptional and translational regu- 
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iatory sequences can be employed, depending upon the nature of the host. 
Preferably, these regulatory signals are associated in their native state with a 
particular gene which is capable of a high level of expression in the host cell. 

In eukaryotes. where transcription is not linked to translation, such 
5 control regions may or may not provide an initiator methionine (AUG) codon. 
depending on whether the cloned sequence contains such a methionine. Such 
regions will, in general, include a promoter region sufficient to direct the 
initiation of RNA synthesis in the host cell. Promoters from filamentous 
fungal genes which encode a mRNA product capable of translation are 
10 preferred, and especially, strong promoters can be employed provided they 
also function as promoters in the host cell. Preferred strong eukaryotic pro- 
moters for use in Trichoderma include the T. reesei cbh\ gene promoter or a 
promoter of another cellulase gene such as that for the cbhl, egll or egll gene 
may be used. In addition to the use of Trichoderma regulatory elements, the 
1 5 expression of proteins may be placed under the control of regulatory elements 
from Aspergillus nidulans (for example, the argB gene promoter and the amdS 
gene promoter), Aspergillus niger (for example, the phytase promoter or the 
glucoamylase gene promoter) However, expression under non-Trichoderma 
regulatory elements such as these may be very low as compared to the use of 
20 Trichoderma elements, and especially those of T. reesei. 

As, is widely known, translation of eukaryotic mRNA is initiated at the 
codon which encodes the first methionine. For this reason, it is preferable to 
ensure that the linkage between a eukaryotic promoter and a DNA sequence 
which encodes the desired protein, or a functional derivative thereof, does not 
25 contain any intervening codons which are capable of encoding a methionine. 
The presence of such codons results either in a formation of a fusion protein 
(if the AUG codon is in the same reading frame as the protein-coding DNA 
sequence) or a frame-shift mutation (if the AUG codon is not in the same 
reading frame as the protein-coding sequence). 
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h may be desired to construct a fusion product that contains a partial 
coding sequence (usually at the amino terminal end) of a protein and a second 
coding sequence (partial or complete) of a phytase degrading enzyme of the 
invention. The sequence that does not encode the phytase degrading enzyme 
may or may not function as a signal sequence for secretion of the protein from 
the host cell. For example, the sequence coding for desired protein may be 
linked to a signal sequence which will allow secretion of the protein from, or 
the compartmentalization of the protein in.' a particular host. Such fusion 
protein sequences may be designed with or without specific protease sites such 
that a desired peptide sequence is amenable to subsequent removal. In a 
preferred embodiment, the native signal sequence of a fungal protein is used, 
or a functional derivative of that sequence that retains the "ability to direct the 
secretion of the peptide that is operably linked to it. Aspergillus 
leader/secretion signal elements also function in Trichoderma. 
15 Transcriptional initiation regulatory signals can be selected which allow 

for repression or activation, so that expression of the operably linked genes 
can be modulated. For example, regulatory signals may be temperature- 
sensitive so that by varying the temperature, expression can be repressed or 
initiated, or are subject to chemical regulation, e.g., metabolite. Translational 
20 signals are not necessary when it is desired to express antisense RNA 
sequences. 

If desired, the non-transcribed and/or non-translated regions 3' to the 
sequence coding for a desired protein can be obtained by the above-described 
cloning methods. The 3 '-non-transcribed region may be retained for its 
25 transcriptional termination regulatory sequence elements, or for those elements 
which direct polyadenylation in eukaryotic cells. Where the native expression 
control sequences signals do not function satisfactorily in a host cell, then 
sequences functional in the host cell may be substituted. 

The vectors of the invention may further comprise other operably 
30 linked regulatory elements such as DNA elements which confer antibiotic 
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resistance, or origins of replication for maintenance of the vector in one or 
more host cells. 

In another embodiment, especially for maintenance of the vectors of the 
invention in prokaryotic cells, or in yeast S. cerevisiae cells, the introduced 
5 sequence is incorporated into a plasmid or viral vector capable of autonomous 
replication in the recipient host. Any of a wide variety of vectors may be 
employed for this purpose. In Bacillus hosts, integration of the desired DNA 
may be necessary. 

Factors of importance in selecting a particular plasmid or viral vector 
10 include: the ease with which recipient cells that contain the vector may be 
recognized and selected from those recipient cells which do not contain the 
vector; the number of copies of the vector which are desired in" a particular 
host; and whether it is desirable to be able to "shuttle" the vector between host 
cells of different species. 
15 Preferred 5. cerevisiae yeast plasmids include those containing the 2- 

micron circle, etc., or their derivatives. Such plasmids are well known in the 
an (Botstein, D., et al., Miami Wntr. Symp. 79:265-274 (1982); Broach, J.R., 
in: The Molecular Biology of the Yeast Saccharomyces: Life Cycle and 
Inheritance, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, p. 
20 445-470 (1981); Broach. J.R., Cell 28:203-204 (1982); Bollon, D.P., et aL. 
J. Clin. HematoL Oncol. 70:39-48 (1980): Maniatis. T., In: Cell Biology: 
A Comprehensive Treatise, Vol. 3. Gene Expression. Academic Press, NY. 
pp. 563-608 (1980)), and are commercially available. 

Once the vector or DNA sequence containing the construct(s) is 
25 ' prepared for expression, the DNA construct(s) is introduced into an 
appropriate host cell by any of a variety of suitable means, including 
transformation. After the introduction of the vector, recipient cells are grown 
in a selective medium, which selects for the growth of vector-containing cells. 
Expression of the cloned gene sequence(s) results in the production of the 
30 desired protein, or in the production of a fragment of this protein. This 
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expression can take place in a continuous manner in the transformed cells or 
in a controlled manner, for example, by induction of expression. 

Fungal transformation is carried out also accordingly to techniques 
known in the an, for example, us.ng, for example, homologous recombination 
to stably insert a gene into the fungal host and/or to destroy the ability of the 
host cell to express a certain protein. 

IV - PREPARATI ON OF ANJTfRnnT^ 

In the following description, reference will be made to various 
methodologies well-known to those skilled in the art of immunology. Standard 
reference works setting forth the general principles of immunology include the 
work of Catty, D. (Antibodies. A Practical Approach, Vol. 1 IRL Press 
Washington. DC (1988)); Klein. J. (Immunology: The Science of Cell-Noncell 
Discrimination, John Wiley & Sons, New York (1982)); Kennefl. R., et al 
in Monoclonal Antibodies, Hybridoma: A New Dimension in Biological 
Analyses, Plenum Press, New York (1980)); Campbell, A. ("Monoclonal 
Annbody Technology," in: Laboratory Techniques in Biochemistry and 
Molecular Biology, Volume 13 (Burdon, R.. et al., eds.) Elsevier 
Amsterdam (1984)); and Eisen. H.N., in: Microbiology, 3rd Ed. (Davis. 
B.D., et al.. Harper & Row. Philadelphia (1980)). 

An antibody is said to be "capable of binding" a molecule if it is 
capable of specifically reacting with the molecule to thereby bind the molecule 
to the antibody. The term "epitope" is meant to refer to that portion of a 
hapten which can be recognized and bound by an antibody. An antieen may 
have one. or more than one epitope. An "antigen" is capable of inducing an 
animal to produce antibody capable of binding to an epitope of that antigen. 
The specific reaction referred to above is meant to indicate that the antisen 
will react, in a highly selective manner, with its corresponding antibody and 
not with the multitude of other antibodies which may be evoked by other 
antigens. 
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The term "antibody" (Ab) or "monoclonal antibody" (Mab) as used 
herein is meant to include intact molecules as well as fragments thereof (such 
as. for example. Fab and F(ab') : fragments) which are capable of binding an 
antigen. Fab and F(ab') : fragments lack the Fc fragment of intact antibody. 
5 clear more rapidly from the circulation, and may have less non-specific tissue 
binding of an intact antibody (Wahl et al.. J. Nucl. Med. 24:316-325 (1983)). 

The antibodies of the present invention are prepared by any of a variety 
of methods. Preferably, purified phytase or pH 2.5 acid phosphatase protein, 
or a fragment thereof, (treated or not treated with endoF or its equivalent to 
10 remove sugar moieties), is administered to an animal in order to induce the 
proration of sera containing polyclonal antibodies that are capable of binding 
sucn phytase or pH 2.5 acid phosphatase. 

Cells expressing phytase or pH 2.5 acid phosphatase protein, or a 
fragment thereof, or, a mixture of proteins containing phytase or pH 2.5 acid 
15 phosphatase or such fragments, can also be administered to an animal in order 
to induce the production of sera containing polyclonal antibodies, some of 
which will be capable of binding phytase or pH 2.5 acid phosphatase protein. 
If desired, such phytase or pH 2.5 acid phosphatase antibody may be purified 
from the other polyclonal antibodies by standard protein purification 
20 techniques and especially by affinity chromatography with purified phytase or 
pH 2.5 acid phosphatase or fragments thereof. 

A phytase or pH 2.5 acid phosphatase protein fragment may also be 
chemically synthesized and purified by HPLC to render it substantially free of 
contaminants. Such a preparation is then introduced into an animal in order 
25 to produce polyclonal antisera of high specific activity. 

Monoclonal antibodies can be prepared using hybridoma technology 
(Kohler etal.. Nature 256:495 (1975); Kohler etal. Eur. J. Immunol. 6:511 
(1976): Kohler et al.. Eur. J. Immunol. 6:292 (1976): Hammerling et al.. in: 
Monoclonal Antibodies and T-Cell Hybridomas. Elsevier. N.Y.. pp. 563-681 
30 (1981)). In general, such procedures involve immunizing an animal with 
phytase or pH 2.5 acid phosphatase protein antigen. The splenocytes of such 
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animals are extracted and fused with a suitable myeloma cell line. Anv 
suitable myeloma cell line may be employed in accordance with the present 
invention: however, it is preferable to employ the parent myeloma cell line 
(SP,0), available from the American Type Culture Collection. Rockville. 
Maryland. After fusion, the resulting hybridoma cells are selectively 
maintained in HAT medium, and then cloned by limiting dilution as described 
by Wands, J.R., etal.. Gastroenterology 50:225-232 (1981). which reference 
is herein incorporated. by reference. The hybridoma cells obtained through 
such a selection are then assayed to identify clones which secrete antibodies 
capable of binding the phytase or pH 2.5 acid phosphatase protein antigen. 

Through application of the above-described methods, additional cell 
lines capable of producing antibodies which recognizVepitopes of the phytase 
or pH 2.5 acid phosphatase protein can be obtained. 

Antibodies against both highly conserved and poorly conserved regions 
15 of the phytase or pH 2.5 acid phosphatase protein are useful for studies on the 
control of biosynthesis and catabolism of phytase or pH 2.5 acid phosphatase 
protein, and for studies wherein it is necessary to identify or quantitate the 
presence and/or of the protein antigen in a composition. 

V - PRODUCTIO N OF PHYTASE AND ACID PHOSPHATASE 

20 Tne De st phytase production levels are obtained when Trichoderma are 

transformed with linear DNA using the cbhl promoter (about 3,800 PNU/ml. 
see Table 8). About 3.500 and 3.600 PNU/ml culture medium was obtained 
with the best T. reesei ALK02221 and ALK0233 transformants containing no 
E. coli sequences. Both the phytase and the cbhl signal sequence seemed to 
work equally well and the same levels in phytase production could be achieved 
when using T. reesei ALKO 2221 or ATCC56765 as a host strain. In T. reesei 
ALKO 233 the level of phytase activity produced was higher when the phytase 
signal sequence was used. 



25 
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Phytase is expressed from the Trichoderma hosts of the invention into 
the supernatant of the culture medium. The amount of phytase in the culture 
medium is generally higher than any hitherto reported amount ot a 
heterologous protein that was expressed in Trichoderma. 
5 The spectrum of enzymes that accompany phytase in the Trichoderma 

strains of the invention is greatly different and advantageous over that of 
similar preparations of Aspergillus culture supernatants. Both endoglucanase 
and cellobiohydrolase .activities are generally substantially higher using the 
Trichoderma hosts of the invention. The glycosylation pattern of the phytase 
10 is also different when it is expressed from Trichoderma. resulting in a phytase 
protein that migrates as multiple bands on Western analysis. 

The best production of pH 2.5 acid phosphatase from ^Trichoderma 
transformants of the invention resulted in 240 APNU/ml culture medium, in 
shake flask cultivation and in lactose based medium. As with the phytase, both 
15 the acid phosphatase and the cbh\ signal sequence worked equally well. 

The compositions of the invention that contain phytase may be used 
directly for the removal of phytic acid, or inositol hexaphosphoric acid, from 
raw material, especially phytin-containing raw material, and especially plant 
material. Phytase removes the phosphate groups from phytic acid and destroys 
20 its ability to interfere with mineral absorption. When used as an animal feed 
additive, the phytase compositions of the invention release phosphate bound 
to phytin in grain and thus dramatically reduce the need for doses of additional 
phosphate in feed formulations and lessen environmental loads. 

The phytase and pH 2.5 acid phosphatase produced according to the 
25 invention may be purified by protein purification methods known in the art. 

Having now generally' described the invention, the same will become 
better understood by reference to certain specific examples that are included 
herein for purposes of illustration only and are not intended to be limiting 
unless otherwise specified. 
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EXAMPLF.S 



EXAMPLE i 
Method of Assay of Phvtase Activity 

Principle Phytase acts on phytate (inositol hexaphosphate) to release 
5 inorganic phosphate. The determination of released inorganic phosphate is 
based on the color formed by the reduction of a phosphomolybdate complex. 

Unit of Activity One phytase unit (PU) is the amount of enzyme which 
liberates, under standard conditions, 1 nmol of inorganic phosphate from 
sodium phytate in one minute. ^~ 

10 Assay conditions 

Substrate Sodium phytate 

PH 5.0 

incubation temperature 37°C ± 0.5°C 

incubation time 15 minutes 

15 Equipment 

Water bath 37 °c 

Water bath 50°C 
Spectrophotometer 
Test tube mixer (vortex) 
-° Phosphate Free Glassware 

Reagents All solutions are prepared in deionized water. Milli-Q or 
equivalent. 

1. Citrate Buffer (0.2 M, pH 5.0). Prepare 0.2 M solutions of 
both sodium citrate (QH,0 7 Na-2 H,0. 58.8 g/I. Merck 6448) and citric acid 

25 (C„H,CvH : 0. 42.0 g/1. Merck 244) in water. Adjust the pH of the citrate 
solution (1 liter) to 5.0 with 0.2 M citric acid (the consumption of citric acid 
solution should be about 385 ml). 

2. Substrate. Dissolve 1 .00 g of sodium phytate (Sigma P-3168) 
in about 70 ml citrate buffer. Adjust the pH to 5.0 with 0.2 M citric acid and 
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adjust the volume to 100 ml with citrate buffer. Fresh substrate solution must 

be prepared daily. 

3. 15% (w/v) TCA Solution. Prepare from trichloroacetic acid 

(Merck 807). 

5 4. 10% (w/v) Ascorbic Acid Solution. Prepare from ascorbic acid 

(Merck 127). Store under refrigeration. The solution is stable for seven days. 

5. 2.5% (w/v) Ammonium Molybdate Solution. Dissolve 2.5 g 
(NH 4 ) 6 Mo 7 0 :4 4H,0. Merck 1182) in water and make up to 100 ml. 

6. 1 M Sulfuric Acid. Add 55.6 ml of concentrated H,S0 4 (Merck 
10 731) to about 800 ml of water, with stirring. Allow to cool and make up to 

1000 ml with water. 

7. Reagent C. Mix 3 volumes of 1 M sulfuric acid wilh 1 volume 
of 2.5% ammonium molybdate, then add 1 volume of 10% ascorbic acid and 
mix well. Fresh reagent C must be prepared daily. 

15 Sample dilution Samples are diluted in citrate buffer. Make duplicate 

dilutions of each sample. In case of enzyme powder weigh accurately about 
250 mg of sample, dissolve in the buffer and fill to 25 ml in a volumetric 
flask, dilute further if necessary. 





Dilution table: 


20 


Estimated activity 


Recommended dilution 


Dilution factor 




PU/ml 








2000 


1 + 19 


20 




20000 


1 + 199 


200 




40000 


1 + 399 


400 


25 


100000 


1 + 999 


1000 




500000 


1 4- 4999 


5000 



Assav 



Hvdrolvsis Pipette 1.0 ml of sample dilution containing 20-190 PL) in 
two test tubes. Add 2.0 ml of 15% TCA to one of the tubes (blank) and mix. 
30 Put the tubes without TCA in a water bath at 37°C and let them equilibrate for 
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10 



15 



!0 



5 minutes. Using a stopwatch start the hydrolysis bv addin* sequentially at 
proper intervals 1.0 ml of substrate (equilibrated for about 10 minutes at 
37°C) to each tube and mix. After exactly 15 minutes incubation stop the 
reaction by adding 2.0 ml of TCA to each tube. Mix and cool to room 
temperature. Add 1.0 ml substrate to the blank tubes (kept at room 
temperature) also and mix. If precipitate occurs it must be separated by 
centnfugation for 10 minutes at 2000-g. 

Released orrhophosplw Pipette 0.4 ml of each sample after 
hydrolysis in test tubes. Add 3.6 ml of water to each tube. Add 4.0 ml of 
reagent C and mix. Incubate at 50°C for 20 minutes and cool to room 
temperature. Measure the absorbance against that of reagent blank (see below, 
at 820 nm. ^~ 

Standard Prepare a 9.0 mM phosphate stock solution. Dissolve and 
dilute 612.4 mg KH : P0 4 (Merck 4873, dried in desiccator with silica) to 500 
ml with water in a volumetric flask. Make the following dilutions in water 
from the stock solution and use these as standards. 



Dilution 


Phosphorus 
concentration nmol/ml 


Phvtase activity 
PU/ml* 


1:100 
1:200 
1:400 


90 
45 
22.5 


240 
120 
60 



fh„ „h« u K 5 f»ju»c auuviiy (ru/ml) is obtained by dividing 
the phosphorous concentration (nmol/ml) by the time of hydrolysis (15 
minutes, and mu nplying by four (total volume after hydrolysis' ac on 

Phrpto^r^ ^ ^ 10 (dilUti ° n bef ° re ana ' ySiS ° f in °^ ic 



Pipette 4.0 mi of each dilution to two test tubes. Pipette also 4.0 ml 
of water in one tube (reagent blank). Add 4.0 ml of reagent C and mix 
Incubate at 50°C for 20 minutes and cool to room temperature. Measure the 
absorbances at 820 nm against that of reagent blank. Prepare a standard curve 
by blotting the absorbances against phvtase activity (PU/ml). A new standard 
line must be constructed with each series of assavs. 
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Calculation Subtract the blank absorbance from the sample absorbance 
(the difference should be 0.100 - 1.000). Read the phytase activity (PU/ml) 
from the standard line and multiply by the dilution factor. To calculate the 
activity (PU/g) of enzyme powders the result (PU/ml) is further multiplied by 
5 25 (ml) and divided by the exact weight of the sample (g). 

Preparation of feed and other insoluble samples for phvta se analysis 
Weigh accurately about 2.5 g of ground sample in two 50 ml beakers. Add 
20.0 ml of citrate buffer. Mix using a magnetic stirrer for 30 minutes at room 
temperature. Transfer about 10 ml of each in centrifuge tubes and separate 

10 the solid matter by centrifugation for 10 minutes at 2000-g. Apply 2.5 ml of 
supernatant on PD-10 gel filtration columns (Sephadex G-25M>Pharmacia 17- 
0851-01) equilibrated with 25 ml citrate buffer. Discard the eluate. Then 
apply 3.5 ml citrate buffer on the column and collect the eluate in a graduated 
cylinder. Fill the volume to 5.0 ml with citrate buffer (dilution factor 2) and 

15 assay for phytase activity. The activity PU/g is obtained by multiplying the 
measured activity (PU/ml) by 40 (dilution factor • volume of extraction buffer) 
and dividing by the exact weight of sample (g). Reference: Chen et al. Anal. 
Chem. 25:1756-1758 (1956). 

EXAMPLE 2 

20 Assay of Acid Phosphatase Activity 

Principle . Acid phosphatase acts on p-nitrophenyl phosphate to release 
inorganic phosphate. The determination of released inorganic phosphate is 
based on the color formed by the reduction of phosphomolybdate complex. 

Unit of activity . One acid phosphatase unit (HFU) is the amount ot 
25 enzyme which liberates, under standard conditions. 1 nmol of inorganic 
phosphate from p-nitrophenyl phosphate in one minute. 



WO 94/03612 



PCT/FI93/00310 



- 32 - 



Assay conditions . 
Substrate 
pH 

Temperature 
incubation time 



2.5 

37°C ± 0.5°C 
15 min 



p-nitrophenyl phosphate 




37°C 
50°C 



Water bath 



Spectrophotometer 
Test tube mixer (vortex) 
Centrifuge (Hereaus Biofuge 
17S. 3090 or equivalent 
Phosphate Free Glassware 

Reagents . All solutions are prepared in deionized water, Milli-Q 



1. Glycine Buffer (0.2 M, pH 2.5) 

Dissolve 15.014 g glycine (Merck 4201) in about 800 ml of water. 
Adjust the pH to 2.5 with 1 M hydrochloric acid (consumption should 
be about 80 ml) and dilute to 1000 ml with water. 

2. Substrate (30 mM) 

Dissolve 1.114 g /vnitrophenyl phosphate (Boehringer. 738 352) in 
glycine buffer and adjust the volume to 100 ml with the buffer. Fresh 
substrate solution must be prepared daily. 

3. 15% (w/v) TCA Solution 

Prepare from trichloroacetic acid (Merck 807). 

4. 10% (w/v) Ascorbic Acid Solution 

Prepare from ascorbic acid (Merck 127). Store under refrigeration. 
The solution is stable for 7 days. 

5. 2.5% (w/v) Ammonium Molybdate Solution 

Dissolve 2.5 g (NHJ f ,M0 7 0v4H : 0. Merck n 82) in water and make 
up to 100 ml. 



15 



equivalent. 



6. 1 M Sulphuric Acid 

Add 55.6 ml of concentrated H 2 SO A (Merck 731) to about 800 ml of 
water, with stirring. Allow to cooi and make up to 1000 rnl with 
water. 

7. Reagent C 

Mix 3 volumes of 1 M sulphuric acid with 1 volume of 2.5% 
ammonium molybdate, then add 1 volume of 10% ascorbic acid and 
mix well. Fresh reagent C must be prepared daily. 
Sample dilution . Samples are diluted in glycine buffer. Make 
duplicate dilutions of each sample. In case of enzyme powder weigh 
accurately about 250 mg of sample, dissolve in the buffer and fill to 25 ml in 
a volumetric flask, dilute further if necessary. ^~ 



Dilution table: 



Estimated activity 


Recommended 


Dilution 


HFU/ml 


dilution 


factor 


20000 


1 + 19 


20 


200000 


1 4- 199 


200 


400000 


1 + 399 


400 


1000000 


1 + 999 


1000 


5000000 


1 + 4999 


5000 



Assav . 

Hvdrolvsis : Pipette 1.9 ml of substrate in two test tubes. Add 
2.0 ml of 15% TCA to one of the tubes (blank) and mix. Put the tubes 
without TCA in a water bath at 37°C and let them equilibrate for 5 min. 
Using a stopwatch start the hydrolysis by adding sequentially at proper 
intervals 0.1 ml of enzyme dilution to each tube and mix. After exactly 15 
min incubation stop the reaction by adding 2.0 ml of TCA to each tube. Mix 
and cool to room temperature. Add 0. 1 ml of sample to the blank tubes (kept 
room temperature) also and mix. If precipitate occurs it must be separated 
by centrifugation for 10 min at 2000-g. 

Released orthophosphate : Pipette 0.4 ml of each sample after 
hydrolysis in test tubes. Add 3.6 ml of water to each tube. Add 4.0 ml ot 
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reagent C and mix. incubate at 50°C for 20 min and cool to room 
temperature. Measure the absorbance against that of reagent blank (see below ) 
at 820 nm. 

Standard. Prepare a 9.0 mM phosphate stock solution. Dissolve and 
5 dilute 612.4 mg KH,P0 4 (Merck 4873. dried in dessicator with silica) to 500 
ml with water in a volumetric flask. Make the following dilutions in water 
from the stock solution and use these as standards. 



Dilution 


Phosphorus concentration 
nmol/ml 


Acid phosphatase activity 
HFU/ml* 


1:100 


90 


2400 


1:200 


45 


1200 


1:400 


22.5 


600 



The corresponding acid phosphatase activity (HFU/ml) is obtained by 
dividing the phosphorus concentration (nmol/ml) by the time of hydrolysis (15 
min) and multiplying by 40 (total volume after hydrolysis reaction / sample 
volume) and by 10 (dilution before analysis of inorganic phosphorus). 



Pipette 4.0 ml of each dilution to two test tubes. Pipette also 4.0 ml 
of water in one tube (reagent blank). Add 4.0 ml of reagent C and mix. 
Incubate at 50°C for 20 min and cool to room temperature. Measure the 
absorbances at 820 nm against that of reagent blank. Prepare a standard curve 

20 by blotting the absorbances against acid phosphatase activity (HFU/ml). A 
new standard line must be constructed with each series of assays. 

Calculation. Subtract the blank absorbance from the sample absorbance 
(the difference should be 0.100-1.000). Read the acid phosphatase activity 
(HFU/ml) from the standard line and multiply by the dilution factor. To 

25 calculate the activity (HFU/gj of enzyme powders the result (HFU/ml) is 
further multiplied by 25 (ml) and divided by the exact weight of the sample 
(g). 

Preparation of feed and other insoluble samples for acid phosphatase 
analysis. Weigh accurately about 2.5 g of ground sample in two 50 ml 
30 beakers. Add 20.0 ml of glycine buffer. Mix using a magnetic stirrer for 30 
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min at room temperature. Transfer about 10 ml of each in centrifuge tubes 
and separate the solid matter by centrifugation for 10 min at 2000-g. Apply 
2.5 ml of supernatant on PD-10 gel filtration columns (Sephadex G-25M. 
Pharmacia 17-0851-01) equilibrated with 25 ml glycine buffer. Discard the 

5 eluate. Then apply 3.5 ml of glycine buffer on the column and collect the 
eluate in a graduated cylinder. Fill the volume to 5.0 ml with glycine buffer 
(dilution factor 2) and assay for acid phosphatase activity. The activity HFU/g 
is obtained by. multiplying the measured activity (HFU/ml) by 40 (dilution 
factor • volume of extraction buffer) and dividing by the exact weight of 

10 sample (g). Reference: Chen, P.S.. etaL.Anal. Chem. 28:1756-1758 (1956). 

EXAMPLE 3 ^ 
Purification of Phvtase and pH 2.5 Acid Phosphatase 

For reference to how the skilled artisan would purify phytase and pH 2.5 acid 
phosphatase, the following are provided. 

5 I. PHYTASE 

Enzvme purification . Steps were done at 4 to 8 °C unless otherwise 
stated. The starting material was the cell-free culture medium concentrate 
produced by Aspergillus niger var. awamori ALKO 243. 

Ammonium sulphate precipitation . The culture filtrate concentrate (990 
10 ml) was kept on an ice bath and 0.436 g ammonium sulphate per ml was 
added (70% saturation). After 30 minutes the precipitate was separated by 
centrifugation for 15 minutes at 10000*g and discarded. 

Hydrophobic interaction chromatography . The supernatant (1070 ml) 
was applied to an Octyl-Sepharose CL-4B (Pharmacia) column (5 cm x 17 cm) 
15 equilibrated with a solution containing 0.436 g (NH 4 ) : S0 4 per ml of 20 mM 
bis-Tris/HCl (pH 6.2). The column was washed with 500 ml of the 
equilibration solution and then developed with a linear gradient of 500 ml 
containing 70-*0% amonium sulfate in 20 mM bis-Tris/HCl (pH 6.2). 
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Fractions of 10 ml were collected and analyzed for phytase and acid 
phosphatase activity. Most of the phytase activity eluted in the beginning of 
the gradient. The fractions were pooled for the next step. The fractions 
eluting after phytase activity and containing most of the acid phosphatase 
activity were pooled for acid phosphatase purification (see below). 

Anion exchange chromatography The pooled phytase fractions (129 
ml) were concentrated by ultrafiltration using an Amicon PM 10 membrane. 
The residual ammonium sulphate was removed by PD 10 (Pharmacia) gel 
filtration columns equilibrated with 50 mM bis-Tris/HCl (pH 6.2). The 
sample, in 24.5 ml, was applied to a DEAE-Sepharose (Pharmacia) column 
(5 cm x 7 cm) equilibrated with 50 mM bis-Tris/HCl (pH 6.2). The column 
was washed with the equilibrium buffer (100 ml) and developed by a linear 
gradient of 200 ml containing 0-*0.5 M NaCl in equilibrium buffer. 

Gel filtration. The pooled active fractions were concentrated using a 
15 Centricon -30 microconcentrator to a total volume of 600 /xl. Portions of 100 
t*\ were run at about 23 °C and 0.3 ml/min through a Superose 12~HR 10/30 
HPLC column (Pharmacia) equilibrated with 50 mM bis-Tris/HCl (pH 6.2). 

Cation exchange. The pooled active fractions were transferred to 50 
mM sodium formiate (pH 3.8) using a Centricon -30 microconcentrator. The 
20 sample was applied in two portions of 2 ml to a Mono S HR 5/5 FPLC 
column (Pharmacia) equilibrated with 50 mM sodium formiate (pH 3.8) at 
about 23°C. The column was washed with the equilibration buffer (10 ml) 
and the bound protein was eluted at 60 ml/h with a linear gradient of 20 ml 
containing 0-*430 mM NaCl in equilibration buffer. 



25 II. ACID PHOSPHATASE 

Gel filtration. The pooled fractions containing most of the acid 
phosphatase activity from the hydrophobic interaction chromatography step 
were concentrated by ultrafiltration using an Amicon PM 10 membrane. The 
concentrated sample (25 ml) was run through a Sephacryl S-200 (Pharmacia) 
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column (2.6 cm x 94 cm) equilibrated with 50 mM bis-Tris/HCl (pH 6.2) at 
20 ml/h. 

Anion exchange chromatography . The pooled fractions (48 ml) were 
applied to a DEAE-Sepharose (Pharmacia) column (5 cm x 7 cm) equilibrated 

5 with 50 mM bis-Tris/HCl (pH 6.2). The column was washed with 100 ml of" 
equilibration buffer and developed with a linear gradient of 200 ml containing 
0-»0.5 M NaCl in equilibration buffer. 

Anion exchange chromatography . The pooled active fractions were 
concentrated and transferred to 20 mM bis-Tris/HCl (pH 6.0) by ultrafiltration 

10 using an Amicon PM 10 membrane. The sample was run in four portions of 
3.5 ml on Mono Q HR 5/5 HPLC column (Pharmacia) equilibrated with 20 
mM bis-Tris/HCl (pH 6.0) at about 23°C and 60 ml/h. The column was 
washed with 10 ml of the equilibrium buffer and the bound protein was eluted 
with a linear gradient of 20 ml containing 0-*350 mM NaCl in equilibrium 

15 buffer. 

Gel filtration . The active fractions were pooled, concentrated and 
transferred to 20 mM bis-Tris/HCl (pH 6.2) containing 150 mM NaCl with 
Centricon -30 microconcentrator to total volume of 400 fi\. Portions of 100 
n\ were run at about 23°C and 18 ml/h through a Superose 12 HR 10/30 

20 HPLC column (Pharmacia) equilibrated with the sample buffer. 

Anion exchange chromatography . The pooled active fractions were 
transferred to 20 mM histidin/HCl (pH 5.8) with a PD 10 gel filtration 
column. The sample was run in four portions of 1 ml on Mono Q HR 5/5 
HPLC column (Pharmacia) equilibrated with the sample buffer at about 23 °C 

25 and 60 mi/h. The column was washed with 5 ml of the sample buffer and the 
bound protein was eluted with a linear gradient of 20 ml containing 0-*350 
mM NaCl in equilibrium buffer. 
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Table 1. 



Seer 



Summary of purif 



Cul cure 
filtrate 



Ajrunonium 
sulfate 
superna tant 

Octyl 
Sephar oae 

DEAE- 
Sepharoa e 

Superose 12 

Mono S 



ication of phy 



tase from As 



Total 
activity 



44 86680 



3771750 

1765881 

1453470 

1010888 
827S66 



protein 
l ma : 



2119 



1263 

32.3 

8 . 4 

5 . 7 
3 . 0 



Specific 
activity 
** PU / rr.a • 



2986 

54671 

173032 

177349 
27588- 



Yield 



.00 



84 . 1 

39.4 

32.4 

32.5 
18 .4 



Purif icaci 
on (fold/ 



26 



82 



Summary of purification of ac id nhnp^ - 
Aspergillus niver Phosphatase from 



Step 



Culture 
filtrate 

Ammonium 
sulphate 
supernatant 



Octyl 
Sephar ose 

Sephac rvl 

DEAE- 
Sepharose 



Total 
activity 
(HFU) 

116S23000 
88275000 



Total 
protein 

t'mg; 

2119 
1263 



68296470 S83 



52237600 
46127692 



97.9 
54.6 



Specific 
act ivity 
(HrU/mg ) 

54990 
69893 



Yield 



100 
75. 8 



117147 

533581 
844830 



Purifica- 
tion 
( fold) 



1 .3 



58.6 

44.fi 
39.6 



2 . 1 

9.7 
15.4 



Mono Q 



19326753 



2 .28 



Superose 



16876978 



nd 



5S923C3 | 16.6 
nd 



107 



Mono Q 



15197050 



na s not determinea 



6 9 07750 



14 . S 



nd 



13 . 0 



126 



EXAMPLE 4 

Characterization of P„rifi» d Phvfw anH 
PH 2.5 Acid Phosp hate PepriHft rwer;^ 

Native purified phytase (70 n) in 50 mM Tris-HCI pH 7 9 was 
due* with 2% ,w/w. trypsin (TPCK-trea.ed. Sigma, for 2 hours a. 37°C 
and then wi,h a funher 1% ,w/ w , lrypsin for hours Qne |m Qf ^ 
punhed phosphatase in 100 mM Tris-HCI pH 8.0 was treated with 2% (w/w) 
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trypsin for 20 hours at 37°C and then with a further 2% (w/w) trypsin for 6 
hours. The peptides were purified as described below. 

Another lot of purified native phosphatase was alkylated using 4-vinyl 
pyridine as follows: To lyophilized phosphatase (75 fig) was added 40 fi\ 0.5 

5 M Tris-HCl pH 7.5 containing 6 M guanidium hydrochloride, 2 mM EDTA 
and 34 mM DTT. After addition of 1 y\ 4-vinyl pyridine (Sigma), the 
reaction mixture was kept at room temperature (22°C) for 1 hour. The 
reaction was stopped by addition of 10 ftl 1.4 M DTT. Alkylated phosphatase 
was then purified on HPLC with a C-l reverse-phase column (TSK TMS 250; 

10 0.46 x 4 cm) using a 20% to 70% ACN/0.06% TFA gradient (80% to 30% 
0.1 % TFA) in 30 minutes. The fractions absorbing at 218 nm were pooled 
and evaporated in a Speed- Vac vacuum centrifuge. The dried sample was 
resuspended in 60 pi 70 mM Tris-HCl pH 9.1 and digested-with 2% (w/w) 
lysylendopeptidase C (Wako Chemicals) for 2 hours at 37° C. After addition 

15 of a further 2% (w/w) lysyl endopeptidase C, the incubation at 37 °C was 
prolonged to 26 hours. The peptides were purified as described below. 

Peptide purification and amino terminal sequencing 

The peptides obtained by digestions were separated by HPLC on a C- 
18 reverse-phase column (Vydac 218 TP B5; 0.46 x 25 cm) with a 90 minute 
20 gradient from 0 to 60% ACN/0.06% TFA (100 to 40% of 0.1% TFA). 
Absorbance at 218 nm was used for detection of peptides. 

Amino terminal sequencing of the purified peptides, as well as the 
native proteins, was done by degrading them in a gas-pulsed-liquid-phase 
sequencer (Kalkkinen and Tilgmann, Journal of Protein Chemistry 7:242-243 
25 (1988)). The released PTH-amino acids were analyzed on-line by using 
narrow-bore reverse-phase HPLC. 
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Carboxv t erminal sequencing of phvtase 

One lot of purified phytase (53 ng) was digested with carboxypeptidase 
Y (Sigma, 0.6 U) in 50 mM sodium acetate P H 5.6 containing 10% urea and 
0.05% SDS at room temperature (22°C). Samples of the digestion were 
5 withdrawn at various time points. These were dried in a Speed-Vac vacuum 
centrifuge and derivatized with phenylisothiocyanate (PITC) according to the 
amino acid analyzing kit Pico-Tag (Waters association). Analysis of the 
derivatized amino acids was performed by reverse-phase HPLC with the Pico- 
Tag C-18 column, and quantified by identically derivatized amino acid 
10 standards. 

Results and Discussion ^~ 

Sequences could be extracted from the peptides showing "double 
sequences" (for both phytase and phosphatase, Tables 3 and 4) since they were 
quantitatively different and/or the other sequence was already known from 
15 peptides sequenced. Native phosphatase seemed to be somewhat resistant to 
lysylendopeptidase C digestion. After alkylation however, peptides of 
phosphatase were nicely obtained with lysyl-endopeptidase C. 

The amino terminal sequence obtained from phytase (Nphy. #1081: 
rSEQ. ID NO.:50:]) was similar, but not identical, to the amino terminal 
20 sequence of A. ficuum phytase (LAVPASRNQSSGDT) [SEQ ID No.: 17:1 
reported by Ullah, A. HJ. Prep. Biochem. /5:459-471 (1988). Peptides 
resulting from trypsin digestions are shown in Table 3. One peptide ( 10 phy) 
[SEQ ID NO.:23:] had identical sequences with the internal peptide of A. 
ficuum phytase (MMQCQAEQEPLVRVLVNDR); Ullah, A.H.J. Prep. 
Biochem. 78:459-471 (1988). Carboxyterminal sequencing of phytase gave the 
sequence XSA-OH. 

No results were obtained from amino terminal sequencing of native and 
alkylated phosphatase (Table 4). One peptide (7Lpho, #817 [SEQ. ID 
NO.:53:]) was. however, identical with the amino terminal sequence from 



25 
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A. ficuum acid phosphatase (pH optimum 2.5: FSYGAAIPQSTQEK 
QFSQEFRDG) published by Ullah. A.H.J, and Cummings, B.J.. Prep. 
Biochem. 77:397-422 ( 1987) and another ( 10 Lpho. #941 [SEQ. IDNO.:54:D 
seems to be a continuation of this. Peptide 3Tpho (peptide #1 106 in Table 4: 
5 SEQ ID No.:61:) could also be a continuation of peptide HLpho (peptide 
#943-2 in Table 4; SEQ ID No.:60:) since it has four overlapping amino 
acids: FSSG. 

Peptide lLpho (peptide #816 in Table 4: SEQ ID No. :57:) contains 
the active site consensus sequence RHGXRXP [SEQ ID No. : 18:1 of phytases 

10 and phosphatases proposed by Ullah. A.H.J, et al.. Biochem. Biophys. Res. 
Comun. 1 75:45-53 (1991). The peptide was highly homologous, but not 
identical. One peptide of phytase (#675, [SEQ. ID NO.:37:], LKDPR) again 
contained part of the KDPRA [SEQ ID No.: 19:] homologic sequence between 
A. ficuum phytase and different phosphatases reported by Ullah, A.H.J, et al.. 

15 Biochem. Biophys. Res. Comun. 775:45-53 (1991). 

The results indicate that A. niger phytase is homologous to A. ficuum 
phytase, but not identical. The same conclusion is reached in the case of acid 
phosphatase (pH optimum 2.5). 
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Table 4. pH 


2.5 acid phosphatase peptide sequences generated by either 


trypsin (T) or endoproteinase Lys-C digestion (L) of purified enzvme 


Cor 


responding nucleotide positions are also listed. 


Peptide Number 


Peptide Seauenee 


1 

j Nucleotide Position 


N -terminal 


FSYGAAIPOSTOFK 


I9j...2_)4 


#817:7Lpho 






[SEQ ID No.:53:| 








OFSOEFRDGY 


235. ..264 


#941: lOLpho 




ISEQ ID No.:54:) 






#938: 2Lpho 


YGGNGPY 


-ioU...300 




[SEQ ID No.:55:[ 








VSYGIA 


*2 1 A ~> T7 

J IU. j27 


#1111: 4Tpho 






[SEQ ID No.:56:] 








RHGERYPSPSAGK 


^76. ..414 


#816: lLpho 




[SEQ ID No.:57:] 








DIEEALAK 


415. ..438 


#847: 5Lpho 




[SEQ ID No.:58:] 








ARYGHLWNGFT 


595. ..627 


#943-1: llLpho 




[SEQ ID No.:59:| 






#943-2: 1 ILpho 


VVPFFSSG 


628. ..651 




lony I1J INO..oU.| 








FSSGYGR 


640. ..660 


#1106: 3Tpho 




[SEQ ID No.:61:] 








QLPQFK 


826.. .843 


#11 10-1: 6Tpho 




[SEQ ID No.:62:| 








VAFGNPY 


1384... 1404 


#1108: 9Tpho 




ISEQ ID No.:63:[ 
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EXAMPLE 5 
The Cloning and Sequencing of the 
pH 2.5 Optimum Acid Phosphatase Gene 
from Aspergillus niger 

5 I. SUMMARY 

The gene for pH 2.5 optimum acid phosphatase has been cloned and 
sequenced from Aspergillus niger. Translated nucleotide sequence yielded a 
polypeptide of 479 amino acids for the pH 2.5 acid phosphatase. The gene for 
this protein was isolated using oligonucleotide probes based on the peptide 
10 sequence of the purified protein. 

II. EXPERIMENTAL AND DISCUSSION 

A. Design of oligonucleotide probes. 

Isolation of the gene encoding pH 2.5 acid phosphatase (AP) was made 
through hybridization of degenerate oligonucleotides designed from peptide 

15 sequences. Several internal peptide fragments had been isolated previously 
and sequenced from purified pH 2.5 AP from A. niger var. awamori strain 
ALKO 243 (ATCC 38854) as described earlier in this patent. 

A 17mer degenerate oligonucleotide, PHY-31, was designed from acid 
phosphatase peptide #816 (lLpho in Table 4 [SEQ. NO.:57:]). Through the 

20 incorporation of a neutral inosine. one perfect match out of 64 possible 
combinations exists in PHY-31. The nucleotide sequence of oligonucleotide 
PHY-31 and corresponding peptide sequence is shown in Figure 1. 
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B. Hybridization specificity of the oligonucleotide probes 

In order to evaluate the specificity of the degenerate oligonucleotides, 
they were end labelled with [7"P]-ATP to a high specific activity using E. coli 
polynucleotide T4 kinase (BRL) and used to probe total genomic DNA from 
5 ALKO 243. Genomic DNA was isolated by a neutral lysis method. Briefly, 
finely ground frozen dried mycelia was lysed with a 4% SDS-TE buffer. Cell 
debris was removed and supernatant was removed and extracted twice with an 
equal volumn of Tris-saturated phenolxhloroform (1:1). Genomic DNA was 
precipitated with NH 4 OAC and EtOH. Pelleted DNA was purified by 

10 ultracentrifugation through CsCl and recovered as described by Maniatis et al. 
Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY, USA (1982)). Hybridization to genomic DNA with 
[7 32 P] ATP labelled degenerate oligonucleotides Maniatis et al. Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 

15 Harbor, NY, USA (1982) was done at 42°C over night on filters in 
oligonucleotide hybridization solution (6X SSPE, 0.5% SDS, 3X Denhardts, 
100 /-ig/ml tRNA). Non-specific binding was removed by washing the filters 
twice for 30 minutes at room temperature with 2XSSC, 0. 1 % SDS, and once 
for 5 minutes at 42°C in fresh solution. Overnight exposure on Kodak X- 

20 Omat AR film with intensifying screens revealed positively hybridizing bands. 

A. niger ALKO 243 genomic DNA was probed with pH 2.5 
oligonucleotide PHY-31. Hybridization was performed as in genomic 
hybridization. Among the oligonucleotides for pH 2.5 AP, only PHY-3 1 gave 
relatively specific hybridization to genomic DNA. Hybridization to one 

25 predominant and one minor band indicated sufficient specificity to use for 
screening the libraries. 
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C. Isolation and characterization of the pH 2.5 Acid Phosphatase 
gene 

Genomic DNA was partially digested with Sau3& in order to produce 
fragments 10-23kb in length. Digested DNA was ligated to BamHl-cui 
5 dephosphorylated Lambda Dash II vector arms (Stratagene). The ligated DNA 
was packaged in vitro using Gigapack Gold packaging extracts (Stratagene). 
Packaged phage was used to infect E. coli strain P2392. The lambda library 
was screened with oligonucleotide PHY-31 for the pH 2.5 AP gene under the 
conditions established with genomic hybridizations in section (B). Twelve 

10 hybridizing plaques were picked for further characterization. Bacteriophage 
DNA isolated from each of the candidates was digested with restriction 
endonucleases and probed with either PHY-31 or a mixture of PHY-34 and 
PHY-35 which were derived from an independent pH 2.5 AP peptide (Figure 
1). One of the clones, AP99, contained a 2.1 kb Sph\ fragment previously 

15 identified in genomic Southern analysis, that hybridized strongly to both 
probes. Strong hybridization to two oligonucleotides derived from different 
peptide sequences suggested that AP99 contained pH 2.5 AP coding 
sequences. This 2.1 kb Sph\ fragment was therefore subcloned into M13mpl8 
and M13mpl9 for sequencing. Translation of the nucleotide sequence of this 

20 subclone revealed the peptide sequences including the N-terminal peptide 
(Table 4). Immediately upstream of the N-terminal peptide is a typical fungal 
secretion signal sequence beginning with a methionine initiation codon at 
position 136. All of the peptide sequences were present in a single ORF except 
#1108 (peptide 9Tpho in Table 4 [SEQ. ID NO.:63:]) which begins at 

25 nucleotide position 1384. Termination codons were identified in all three 
reading frames between nucleotides 1 151 and 1384. These results necessitated 
the inclusion of an intron(s) in the 3' portion of the gene. 

Identification of intron boundaries was made through the isolation and 
sequencing of pH 2.5 AP cDNA. The 3' region of pH 2.5 AP gene was 

30 isolated from the corresponding cDN A by PCR amplification using pH 2.5 AP 
specific primers. A. niger var. awamori ALKO 243 was grown in RNA broth 
media consisting per liter of 2.0% corn starch (Sigma), 1.0% protease peptone 
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(Difco). 30.0g glucose. 5.0g NH.NO,. 0.5g MgS0 4 7H,0. 0.5g KC1. 0.183g 
FeSCV7H : 0. Total RNA was isolated essentially by the LiCl precipitation 
method ofMcAdaand Douglas (McAda. P.C. eiai. Meth. Enzymol. 97:321- 
344 (1983)). Polyadenylated messenger RNA was affinity purified from total 
RNA by the use of oligonucleotide(dT)-ceIlulose columns (Pharmacia) as 
specified by the manufacturer. Oligonucleotide PCR primers UPPHOS 
(5 'GAATTCCGAGTCCGAGGTCATGGGCGCG-3 ') [SEQ ID No.:66:] and 
DOWNPHOS (5'-GAATTCCCGGGACCTACCCCTCTGCAT-3') [SEQ ID 
No.:16:] were synthesized according to genomic sequences with flanking 
£coRI restriction sites. UPPHOS and DOWNPHOS are inversely oriented 
and are separated by 978 bases in the genomic clone. First strand synthesis 
was performed with the 8MB cDNA kit according to the manufacturer- s 
recommendations with 1.0 M g mRNA and DOWNPHOS. PCR amplifiction 
of the cDNA-mRNA complex with oligonucleotide primers UPPHOS and 
DOWNPHOS yielded a specific product of approximately 850 bps. PCR 
amplification of pAP-1 plasmid DNA with the same primers yielded the 
expected product of 1006 bps. Gel purified cDNA PCR product was cut with 
EcoRl and subcioned into pUC-18 for double-stranded sequencing using the 
United States Biochemical Sequencase II kit. The primers amplified an 850 
bp fragment from the cDNA and the expected 1006 bp fragment from cloned 
genomic DNA. Sequencing of the amplified cDNA fragment revealed the 
presence of three short introns. each exhibiting consensus fungal donor, lariat 
and acceptor sequences. The coding sequence is derived by splicing the 
nucleotides 136-916. 971-1088. 1141-1245, and 1305-1740. The resulting 
translated sequence codes for a protein of 479 aa as shown in Figure 2. 

The pH 2.5 AP polypeptide predicted from the nucleotide sequence has 
a calculated M r of 52.678. The 2.1 kb Sphl fragment in pUC18 (pAP-1) 
contained 135 bp of upstream pH 2.5 AP sequence. 
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EXAMPLE 6 

Aspergillus nieer Phvtase Production in Trichoderma reesei 
II. EXPERIMENTAL PROTOCOLS 

1. Bacterial strains, phage and plasmids 
5 For subcloning and sequencing, the E. coli strains DH5a (Hanahan, 

D., "Techniques for transformation of E. coli." in DNA Cloning, vol. 1, 
Glover, D.M., ed.. IRL Press, Oxford, pp. 109-135 (1985); Bethesda 
Research Laboratories, Gaithersburg, MD, USA) and XL-l-Blue (Bullock. 
W.O., etaL. BioTechniques 5:376-378 (1987); Stratagene, La Jolla. CA. 

10 USA) were used. E. coli Y1090 (r) (Huynh, D.S., et a/., "Constructing and 
screening cDNA libraries in Xgtl and Xgtll/ DNA Cloning, vol. 1, Glover, 
D.M., ed., IRL Press, Oxford, pp. 49-57 (1985); Promega Biotec Protoclone 
GT System, Madison, WI, USA) was used as a host in Xgtl 1 phage growing. 
Aspergillus niger var. awamori ALKO 243 (ATCC 38854) was used 

15 as a donor of the phytase gene. T. reesei strains ATCC56765 (RutC-30), 
ALKO 233 (VTT-D-791256, Bailey and Nevalainen, Enzyme Microb. TechnoL 
3:153-157 (1981)) and ALKO 2221 were used as recipients for the phytase 
gene. ALKO 2221 is a low aspartyl protease mutant derived from the strain 
ALKO 233 by UV-mutagenesis. 

20 The phage Xgtl 1 (Promega) was used for making the gene library. The 

phages were grown by the standard methods described by Silhavy et aL 
(Silhavy, T.J.. et aL. Experiments with Gene Fusions. Cold Spring Harbor 
Laboratory, Cold Spring Harbor. NY, USA (1984)). 

As vectors for subcloning, pUC9 (Boehringer, Mannheim. FRG) and 

25 pALK307, a derivative of pIBI76 (IBI, New Haven. Conn., USA) were used. 
To obtain pALK307. an approximately 940 bp NaeUPvul fragment (actually 
941 bp) has been deleted from pIBI76. This the only change in pIBI76. The 
plasmid pAMHUO (Nevalainen. H.. et aL. "The molecular biology of 
Trichoderma and its application to the expression of both homologous and 

30 heterologous genes," in Molecular Industrial Mycology. Leong and Berka. 
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52 



10 



15 



>0 



) 



eds.. Marcel Dekker Inc.. New York. pp. 129-148 (1991)) contains the 
Tnchoderma reesei cbh\ promoter and terminator areas. The plasmid p 3S R^ 
'Kelly and Hynes. EMBO J. 4A15-419 (1985)) contains the Aspergillus 
mdulans acetamidase gene. P 3SR2 has been kindly donated by Dr. M. Hynes 
5 University of Melbourne. Australia). 



2. Growth media and culture conditions 

E. coli cultivations were carried out at 37°C overnight and cultivations 
of filamentous fungi at 30°C for 5 to 7 days for enzyme production and for 
2 days when mycelia was grown for DNA isolation. 

E. coli were grown in L-broth (Maniatis, T., etal.. Molecular 
Cloning, A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Sprint 
Harbor, NY, USA (1982)) supplemented with ampiclllin (50 - 100 M g/ml) 
when needed. PD agar slants (Potato Dextrose broth by Difco, Detroit, MI. 
USA) were used for growing the Aspergillus and the Tnchoderma strains 
Aspergillus niger ALKO 243 mycelium for DNA extraction was grown in 
complete Aspergillus medium containing 2 % (w/v) malt extract (Difco), 0. 1 % 
(w/v) Bacto-peptone (Difco) and 2 % (w/v) glucose. The plates and media for 
T. reesei transformations were as in Penttila et al. (Penttila, M., et al.. Gene 
61: 155-164 (1987)). The transforms were purified on selective acetamide - 
CsCl medium (Penttila. M., et al.. Gene 67:155-164 (1987)) before 
transferring to PD slants. 

For plate screening of high phytase producers, T. reesei clones 
transformed with the cbh 1 promoter/phytase fusion were grown for 3 to 5 davs 
on Trichoderma minimal medium plates with no glucose and supplemented 
with 1 % sodium phytate (Sigma. St. Louis, MO, USA), 1 % Solka Floe and 
1% proteose peptone (Difco). When the construct containing the phvtase 
promoter was used for transformation, screening of high phytase producers 
was carried out on plates containing 1 % sodium phytate and 1 % proteose 
peptone but no sodium phosphate. 

For phytase production A. niger ALKO 243 was grown for 5 days in 
a soy flour medium containing glucose and mineral salts (all from Merck. 
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Darmstadt. FRG); the pH was adjusted to 5.0. For phytase expression from 
the cbh\ promoter. T. reesei transformants were grown for 7 days (250 rpm) 
in a lactose based cultivation medium. For growing the T. reesei transformed 
with the fragment containing the phytase promoter. Trichoderma minimal 
5 medium was supplemented with 50 g/1 soy flour and no sodium phosphate was 
added. 

The contents of the soy flower medium are as follows (per liter): 50 
g of soy flower, 30 g glucose, 5.0 g NH 4 NO ? , 0.5 g MgS0 4 7H : 0, 0.5 g 
KC1, 0.183 g FeS0 4 '7H : 0, atpH 5.0. The lactose based cultivation medium 
10 contains: 4% whey, 1.5% complex nitrogen source, 1.5% KH : P0 4 , 0.5% 
(NH 4 ) 2 S0 4 , at pH 5.5. 

3. DNA preparations ^ 
Plasmid DNA from E. coli (large scale) was isolated by using Qiagen 
columns (Diagen GmbH, Dusseldorf, FRG) according to the manufacturer's 
15 protocol. For rapid screening the method of Holmes and Quigley {Anal. 
Biochem. 114: 193-197 (1981)) was used. Chromosomal DNA from Aspergillus 
was isolated from lyophilized mycelia as described in Clements and Roberts 
{Curr. Genet. 9:293-298 (1986)) and in Raeder and Broda (Lett. AppL 
Microbiol. 7:17-20 (1985)). 

20 4. Cloning procedures 

The standard DNA methods described by Maniatis et al. (Molecular 
Cloning, A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, USA (1982)) were mainly used. The restriction enzymes. T4- 
DNA-ligase, Klenow fragment of DNA polymerase 1, T4 DNA polymerase, 

25 polynucleotide kinase and £coRI methylase ured in the DNA manipulations 
were from Boehringer (Mannheim. FRG) and .*ew England Biolabs (Beverly. 
MA. USA). Mung bean nuclease was from BRL (Gaithersburg, MD, USA) 
and Exolll from Pharmacia (Uppsala. Sweden). Each enzyme was used 
according to the supplier's instructions. 
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For making the gene bank the chromosomal DNA was partiallv 
digested with Haelll. EcoRl methyiase treatment, size fractionation and 
packaging were done as in Paioheimo et al. (Paioheimo, M., et al.. AppL 
Microbiol. BiotechnoL 36:584-591 (1992)). Fragments of a size of 2-8 kb 
were used for construction of the gene bank. 

Subcloning into the plasmid vector was done by using standard DNA 
methods (Maniatis, T. , et al. , Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY, USA (1982)). DNA 
fragments for cloning or transformations were isolated from low-melting-point 
agarose gels (FMC Byproducts, Rockland. ME, USA) by the freeze-thaw- 
phenol method (Benson, S.A.. Bio/Techniques 2:66-68 (1984)) or by using the 
GeneClean® or Mermaid™ Kits (BIO 101 Inc., La Jolla, CA,.USA) according 
to the supplier's instructions. 

Sequencing was carried out directly from the plasmids by the Sanger 
method (Sanger, F., et aL, Proc. Natl. Acad. Sci. USA 74:5463-5467 (1977)) 
by means of SP6, T7, pUC/M13 primers and extension primers and the 
Promega Sequenase sequencing kit (United States Biochemical Corporation, 
Cleveland, OH, USA). Fusions between the cbh\ promoter and the phytase 
gene were sequenced by automated sequencer (Applied Biosystems 373A, 
Foster City, CA. USA) using Taq DyeDeoxy™ Terminator Cycle Sequencing 
Kit (Applied Biosystems). The oligonucleotides used were synthesized by an 
Applied Biosystems 381 A Synthesizer except the pUC primers that were 
purchased from the United States Biochemical Corporation. 

DNA probes were labeled by using the non radioctive DIG-DNA 
Labelling and Detection Kit by Boehringer according to the supplier's 
instructions. 

Hybridizations were done at 68°C as in supplier's instructions 
(Boehringer). Amersham's Hybond N nylon filters were used in the plaque 
screenings and in the Southern blot hybridizations. 

When the plaques were screened with an antiserum, a phytase specific 
polyclonal antiserum KH1236 was used. KH1236 was made against purified 
and deglycosylated phytase preparation (M. Turunen. Alko Ltd.) in the 
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National Public Health institute (Helsinki. Finland). Anti-rabbit-lgG alkaline 
phosphate conjugate and color development substrates from ProtoBlot™ 
Immunoblotting system (Promega) were used to detect the immunocomplexes. 

5 . Transformations 

5 E. coli strains were transformed according to the method of Hanahan 

("Techniques for transformation of E. coli." \r\DNA Cloning, vol; I. Glover. 
D.M., ed., IRL Press, Oxford, pp. 109-135 (1985)), and 7. reesei strains as 
in Penttila et aL {Gene 67:155-164 (1987)). When the ligated fragments were 
transformed (the D-transformant in Table 7), the ligation mixture was not 
10 further purified but was used as such in the transformations. Prior to 
sporulating on potato dextrose agar (PD) slants 7*. . eesei transformants were 
transferred on the selective medium and purified through conidia^ 

6. Enzyme and protein measurements 

For the enzyme assays 7". reesei mycelium was separated from the 
15 culture medium by centrifuging for 15 to 30 min at 5.000 to 10,000 rpm 
(Sorvall SS-34. Dupont Company, Wilmington, Delaware, USA). A. niger 
cultures were centrifuged for 40 min at 10,000 rpm (Sorvall SS-34). The 
phytase activity was measured from the culture supernatant as the amount of 
inorganic phosphate released by enzymatic hydrolysis of sodium phytate 
20 substrate at 37°C as described earlier. One phytase normalized unit (PNU) 
is defined as the amount of phytase activity produced by the A, niger ALKO 
243 strain under the cultivation conditions used. 

The phytase production on the sodium phytate assay plates was 
visualized by pouring the reagent C (3:1:1 ratio of 1 M H : S0 4 , 2.5% 
25 ammonium molybdate. 10% ascorbic acid) on the plates and incubating them 
at 50°C for 15 minutes. The reduction of the phosphomolybdate complex leads 
to bluish color. 

Amyloglucosidase activity (AGU) was measured by using 1% 
Zulkowsky starch (Merck) as a substrate and measuring the amount of the 
30 released glucose units by boiling with DNS reagent (see below) after 10 min 
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of reaction at 60°C at pH 4.8. Proteases (HUT) were measured at pH 4.7 as 
in Food Chemicals Codex (1981) by using 2% haemoglobin (Sigma) as a 
substrate. Endoglucanase (ECU) and cellobiohydrolase (FPU) activities were 
measured as in IUPAC's Measurement of Cellulase Activities (IUPAC 
5 Commission on Biotechnology, Measurement of Cellulase Activities. 
Biochemical Engineering Research Centre. Indian Institute of Technology, 
Delhi, India, pp. 5-7 and 10-11 (1981)). 1% hydroxyethylcellulose (Fluka 
AG) in 50 mM sodium-citrate buffer (pH4.8) and Whatman no. 1 paper were 
used as substrates, respectively. DNS used differed from that described at the 
10 IUPAC's protocol and was made by first diluting 50.0 g 2-hydroxy-3.5- 
dinitrobenzoic acid (Merck) into 4 1 of deionized water. Then, 80.0 g NaOH 
was added slowly by using the magnetic stirrer and 1.500 g sodium-potassium 
tartrate (Merck) was added and diluted by heating the solution (maximum 
temperature 45 °C). The total volume was adjusted to 5 1, the solution was 
15 filtered through Whatman no. 1 and was protected from light. 

CBHI protein was measured from the culture supernatant ~by running 
a SDS-polyacrylamide gel and detecting the CBHI protein band (T. reesei 
ALKO 233 and ALKO 2221 strains) or by dot blotting samples using the 
Schleicher & SchuelPs (Dassel, FRG) Minifold™ Micro-Sample Filtration 
20 Manifold (T. reesei ATCC56765). CBHI protein was detected by using CBHI 
specific monoclonal antibody CI-89 (Aho, S.. etal., Eur. J. Biochem. 
200:643-649 (1991)) and anti-mouse-IgG alkaline phosphate conjugate 
(Promega). Visualization of the immunocomplexes was done as in the plaque 
screening. 
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7. SDS-PAGE and Western blot analysis 

Sodium dodecyl sulphate-poiyacryiamide gel electrophoresis (SDS- 
PAGE) and Western blot analysis were done as in Laernmli (LaemmlL U.K.. 
Nature 227:680-635 (1970)) and as in Towbin et al. (Towbin, H.. et aL. 
5 Proc. Natl. Acad. Sci. USA 76:4350-4354 ( 1979)). Visualization of the phytase 
protein in Western blots was done by using the polyclonal rabbit antiserum 
KH1236. Visualization of the immunocomplexes was done as in the plaque 
screening. 

8. Polymerase Chain Reaction (PCR) 

10 The PCR reactions were performed by Techne Thermal Cycler PHC-2 

(Techne Ltd., Cambridge, UK) in 100 ^1 volumes. The reaction mixtures 
contained 0.2 mM of each dNTP (3'-deoxynucloside-5 '-triphosphate. 
Pharmacia) in 10 mM Tris buffer (pH 8.3), 50 mM KC1, 1.5-2.0 mM MgCU 
and 0.01 % (w/v) gelatin. The protocol used was the following: 95°C (plasmid) 

15 or 100°C (chromosomal DNA)/5 min before adding the Taq DNA polymerase 
(1-2 units, Cetus Corp., Emeryville, CA. USA) and 100 fi\ overlay of paraffin 
oil, denaturation 95°C/1 min, annealing 60°C/1 min (in the inverse PCR 
52°C), extension 72°C/2 min (in the inverse PCR 2.5 min) for 30 cycles. The 
final extension time was 9 min to ensure completion of the strand synthesis. 

20 When a plasmid or DNA fragment was used as a template, the amount of the 
template used was 5-10 ng and 20-50 pmol of each primer was added. When 
chromosomal DNA was used as a template, the corresponding amounts used 
were iOO ng and 50-100 pmol. The circular template for the inverse PCR was 
done from the digested chromosomal DNA as in Innis et al. (Ochman, H., 

25 etaL. "Amplification of flanking sequences by inverse PCR." in PCR 
Protocols. A Guide to Methods and Applications. Innis, M.A., et al.. eds.. 
Academic Press. San Diego, pp. 219-227 (1990)). PCR fragments were 
purified by GeneClean 1 ® or Mermaid™ Kit (from an agarose gel if needed) or 
by Qiagen tips. The ends of the fragments were filled by using the DNA 

30 polymerase I Klenow fragment. 
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II. RESULTS 

A. Molecular cloning of the Aspergillus niger phytase 

1- Production of the phytase probe by 
nested PCR amplification 

The oligonucleotide primers used in the PCR reactions and the 
corresponding amino acid sequences of the phytase ALKO 243 peptides #79? 
(15 phy) and #420 (i 0 phy) (Table 3, as described earlier) are shown in 
Figure 3 (nucleotides 1409-1480 and 1727-1762 in the phytase sequence, see 
Figure 5). Two primary PCR reactions were done. In reaction A. sense 
oligonucleotide 1 (#792) and antisense oligonucleotide 4 (#420) were used; in 
reaction B, sense oligonucleotide 3 (#420) and antisense oligonucleotide 2 
(#792) were used. Primary PCR reaction A gave a single band of about 400 
bps while PCR amplification from the reaction B gave no product. Thus it was 
concluded that the region coding for the peptide #792 was located on the 5'- 
side in the phytase sequence compared to that coding for the peptide #420. 
The primary PCR fragment (A) was used as a template for the second PCR 
with internal primers from the peptide sequences: sense oligonucleotide 2 and 
antisense oligonucleotide 3. PCR amplification from the second PCR reaction 
gave a specific product of about 350 bps. The PCR fragment was cloned to 
Smal digested pUC9 and sequenced. The amino acid sequence deduced from 
the DNA sequence contained also the known amino acids from the peptide 
#792 and #420 that were not coded by the primer sequences. The 
approximately 350 bp PCR fragment was used to probe the DNA bank. 
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2. Screening of the DNA bank 

The gene bank contained approximately 1.9 x 10° pfu (plaque forming 
units)/ml. of which approximately 99.5% had an insert. From about 80.000 
plaques that were screened, two positive clones Hael-6 and Hael-5 were 

5 found. The clones were isolated, purified and the inserts (5.6 and 5.2 kb) were 
subcloned to £coRI cut pALK307. The clones also reacted with the phytase 
antiserum KH1236. The inserts were restriction mapped and the PCR fragment 
was found to hybridize to the about 1 kb BamHl-Sphl restriction fragment of 
the clones (see Figure 4 for the hybridization area in the phytase sequence). 

10 The sequence of the clone Hae2-6 that contained more of the 5 '-sequence 
coded for 15 internal tryptic peptide sequences but the N-terminal amino acid 
sequence was not found. The N-terminal and the promoter area containing 
phage clones were screened from the gene bank by using a 5'-ptobe made by 
inverse PCR (Ochman, H., era/., "Amplification of flanking sequences by 

1 5 inverse PCR, " in PCR Protocols, A Guide to Methods and Applications, Innis, 
M.A.. et aL, eds., Academic Press, San Diego, pp. 219-227 (1990)). 

3. Amplification of the 5 '-end and the 
promoter sequence of the phytase gene 
by inverse PCR 

20 Restriction enzyme digestions of the genomic DNA and Southern 

hybridizations with the 350 bp PCR probe showed that digestions of the 
genomic DNA with Sail produced fragments of suitable size (1-3 kb) for 
circularization and amplification with PCR. The primers used for inverse 
PCR were from bases 1243-1257 (antisense primer) and 1304-1321 (sense 

25 primer) areas of the phytase sequence (see Figure 5). Inverse PCR with Sail 
digested ALKO 243 DNA created a PCR band of about 1.2 kb. The 350 bp 
PCR fragment hybridized to the inverse PCR fragment and by sequencing the 
subcloned fragment it was confirmed that it contained the upstream parts of the 
phytase gene and also the N-terminal peptide sequence was included. 
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4. Isolation of the compiete phytase gene 
The 1.2 kb PCR fragment obtained from the inverse PCR was used as 
a probe to screen 80.000 plaques from which seven positive plaques could be 
identified. The complete phytase gene was isolated on an about 6 kb insert of 
5 a phage clone. 

The about 2.4 kb Sphl fragment containing the phytase gene and the 
promoter area was subcloned into pALK307 (cut with Sphl) to give pALK169 
(Figure 4). The restriction map of the phytase containing Sphl fragment and 
the location of the phytase gene in the fragment are shown in the plasmid map. 

10 The phytase sequence is shown in Figure 5. The sequence coding for 

the phytase protein corresponds to the phytase sequence of Aspergillus ficuum 
published in the Gist brocades (Delft, Netherlands) PCT patent application 
(EP420,358 or WO91/05053) with 12 differences in thetleduced amino acids. 
Each difference in the deduced amino acid was due to one nucleotide's change 

15 and might be due to differences between the strains. Also, in the sequence 
coding for the structural gene, 33 nucleotide differences were found that did 
not lead to differences in the deduced amino acid sequence. In the signal 
sequence, there were differences in two nucleotides (the other lead to a 
difference in the deduced amino acid) and in the proposed intron area 8 

20 differences could be found. The overall match per length (nucleotide sequences 
from the first ATG to the STOP codon TAG) between the two sequences was 
96.3%. The differences found between the two phytase sequences. Gist- 
brocades' and Alko's. are shown in the Table 5. 
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Table 5. Nucleotide and amino acid differences between Gist's and Alko's phvtase j 



sequences. 
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Table 5. Nucleotide and amino acid differences between Gist's and Alko's phytase 
sequences. The nucleotide numbers are counted from the first Met (ATG) in the signal 
5 sequence and amino acid numbers from the N -terminal Leu (see Figure 5). The amino acids, 
if different between the two sequences, are written by bold letters. 

B. Construction of the plasmids for overexpression 
of phytase in Trichoderma reesei 



L The PCR fragments for the precise cbhl 
10 - phytase fusions 

The fusions between the cbhl promoter and the phytase signal 
sequence, and between the cbhl signal sequence and the phytase gene were 
done by PCR. The phytase signal sequence or the phytase protein N-terminal 
sequence start precisely where the corresponding cbhl sequences would start 

15 (for the cbhl sequence, see Shoemaker, S., et al.. Bio/Technology 7:691-696 
(1983)). The primers used for the PCR fragments are shown in Figure 6. 
To construct pALK171 (phytase signal sequence), we made use of the Sacll 
site in the cbhl promoter area in the 5'-PCR primer. The Xho\ site (14 
nucleotides from the N-terminal of the phytase gene) was used in the 3*- 

20 primer. The 5 '-primer was a 39-mer that contained a "tail" of 19 nucleotides 
of the cbhl promoter sequence (preceding the signal sequence) and 20 
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nucleotides from the phytase signal sequence. The 3'-primer was a 22-mer. 
In the construction of pALK172 {cbhl signal sequence), we made use of the 
Sfil site in the cbhl signal sequence in the 5'-primer and Sail site of the 
phytase (762 nucleotides from the N-terminal) in the 3'-primer. In this case, 
the 5 -primer was a 46-mer containing a "tail" of 28 nucleotides and 18 
nucleotides of the phytase N-terminal sequence; the 3'-primer was a 24-mer. 
In all the primers, three to five extra nucleotides were added to the ends of the 
PCR fragments after the restriction site sequences to ensure a correct cut. 
pALK169 was used as a template in the PCR reactions. 

Fragments of the expected lengths were obtained from the PCR 
reactions: fragment containing the wanted fusion was 202 bps for pALK171 
and 800 bps for pALK172. 

2. Construction of plasmids with the cbhl 
promoter: pALK171 (phytase signal 
sequence) and pALK172 (cbhl signal 
sequence) 

The plasmid pALK170L was made by cutting the phytase gene from 
pALK169 as an Sphl-Xhol fragment and ligating it to Sph\-Xho\ cut pALK307. 
The 202 bp PCR fragment containing the cbh\ promoter and phytase signal 
sequence was cut with Xhol and ligated to pALK170L that had been cut with 
XbaU treated with the DNA polymerase I Klenow fragment and cut with Xhol 
(pALK170X. Figure 7). The fusion and the PCR fragment areas were 
sequenced to ensure that no mistakes had occurred in the PCR amplification. 
Phytase fragment containing the fusion was isolated as an Sph\ (treated with 
the T4 DNA polymerase)- Sacll fragment and was inserted between the cbhl 
promoter and terminator areas . of the plasmid pAMHUO previously digested 
with Nde\ (filled in with the DNA polymerase I Klenow fragment) and SacW. 
The plasmid obtained (pALK171X. Figure 7) contains the phytase gene 
precisely fused to the cbhl promoter. To construct pALK171, the amdS gene 
(selectable marker) was isolated from p3SR2 as a Sphl-Kpnl fragment, the 
ends were treated by the T4 DNA polymerase, and amdS was then ligated to 
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the Sphl site of pALK171X (treated with the T4 DNA polymerase). The 
approximately 7.5 kb linear fragment that contained no bacterial sequences 
was isolated from pALK171 by cutting with Xbal and was used for the 
transformations. 

5 pALK172 was constructed essentially the same way as pALK171 (see 

Figure 8). The 800 bp PCR fragment was cut with Sail and ligated to Xbal 
(filled in with the DNA polymerase I Klenow fragment), San cut pALK170L. 
Also in this case, the fusions and the sequence of the PCR fragment were 
checked by sequencing. Phytase-PCR fragment fusion was isolated from 

10 pALKHOS as an Sph\ (treated with the T4 DNA polymerase)-^ fragment 
and ligated to pAMHl 10 that had been cut with Ndel (filled in with the DNA 
polymerase I Klenow fragment)-.^. To construct pALK172, the fragment 
containing the amdS gene was ligated to pALK172S in the same way as when 
constructing pALK171. Xbal was used also in this case to cut out from the 

15 vector backbone the linear fragment that was used in the transformations. 

3. Construction of the plasmids with the phytase 
promoter: pALK173A and pALK173B 

The phytase gene with its own promoter was isolated as an Sph\ 
fragment from pALK169 and ligated into the Kpnl site of p3SR2 (in both cases 

20 the ends of the fragments were filled in by the T4-polymerase) resulting in 
about 11.2 kb plasmids pALK173A and pALK173B (see Figure 4 for 
PALK169 and Figures 7 and 8 for p3SR2 map). In pALK173A the two genes, 
phytase and amdS are in a parallel orientation, in pALK173B, they are in 
opposite orientations (Figure 9). £coRI was used to linearize pALK173A and 

25 pALK173B when linearized plasmids were used for transformations. 



WO 94/03612 



PCT/FI93/00310 



- 65 - 

C. Transformation of Trichoderma reesei and screening of ihe 
transformants 

T. reesei ATCC56765. ALKO 233 and ALKO 2221 strains were 
transformed with circular plasmids and with the Xbal fragments from the 

5 plasmids pALK171 and pALK172 (cbh\ promoter). T. reesei ALKO 233 and 
2221 strains were also transformed with the linearized pALK171 and 
pALK172 plasmids as well as with circular and linear pALK173A and 
pALK173B (phytase promoter) plasmids. Transformation frequencies 
(transformants/^g of DNA) varied from 3 to 60 when the fragments isolated 

10 from pALK171 or pALK172 were used. When pALK171 or pALK172 
circular plasmids were used in transformations, the frequencies were about 
50/ fig for T. reesei ALKO 233 and ALKO 2221 and aboutJ00/Mg for 
T. reesei ATCC56765. Transformation frequencies obtained when linearized 
plasmids were used were about 100//xg. When pALK173A or pALK 173B were 

15 used in transformations the frequencies were from 6 to 26 for the linear 
plasmid and from 6 to 20 for the circular plasmid. 

Regeneration frequency of the sphaeroplasts varied from 4.5% to 
13.2% for T. reesei ALKO 233 and ALKO 2221 strains and was 1-2% for the 
T. reesei ATCC56765 strain. 



WO 94/03612 



PCT/F193/00310 



\ 

rt 

4 



- 66 - 

The amount of the transformants that were screened for the phytase 
production on plates are shown in Table 6. Those clones that clearly produced 
a blue colored halo around the colony were counted as positive clones. 



5 


Table 6. Plate assay positive transformants and total number of tested 
clones 




Plasmid 


ALKO 233 


ALKO 2221 


ATCC56765 


10 


pALK171 

fragment 
circular plasmid 
linear plasmid 


49% (47/96) 
35% (6/17) 
.41% (29/71) 


46% (33/71) 
23% (5/22) 
49% (27/55) 


32% (22/68) 
ND 




pALK172 

fragment 
circular plasmid 
linear plasmid 


47% (48/103) 
17% (4/23) 
37% (22/59) 


30% (34/113) 
13% (2/15) 
24% (1 1/45) 


1 1 % (8/72) 
12% (12/104) 
ND 


15 


pALK173A 

circular plasmid 
linear plasmid 


75% (9/12) 
67% (10/15) 


70% (14/20) 
64% (14/22) 


ND 
ND 


20 


pALK173B 

circular plasmid 
linear plasmid 


40% (4/10) 
63% (10/16) 


63% (15/24) 
67% (8/12) 


ND 
ND 



Table 6. Plate assay positive transformants and total number of tested clones. 
The number of the plate assay positive transformants and the total number of phytase 
plate assay tested transformants are shown. Only those transformant strains that grew 
well both on the selection slant and on the plate assay are included. As positive 
phytase producers are counted those strains that clearly showed phvtase activity on the 
plate assay. 



The transformants that seemed to be the best producers on the plate 
assay were grown on shake flasks. Inocula were taken either directly from 
acetamide slants or from PD slants after purification through conidia. Of the 
T. reesei ATCC56765, ALKO 233 and ALKO 2221. transformed with the 
fragments from pALKI71 and..pALK172. from 7 to 16 clones from each set 
of transformants were purified and the phytase production was tested in shake 
flask cultivations. When circular plasmids pALK171 or pALK172 had been 
used in transformations. 13 and 12 purified T. reesei ATCC56765. and from 
four to seven T. reesei ALKO 233 and ALKO 2221 transformant strains were 
grown, respectively. When linearized pALKlTl or pALK172 plasmids were 



J 
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used, about 20 transformam strains from each were grown in shake flasks. Of 
the T. reesei aLKO 233 clones transformed with the linear pALK173A/B 
plasmids. seven pALK173A and two pALK173B (and with the circular 
plasmids four pALK173A and three pALK173B) transformants indicating 
phytase activity on plates were purified and tested in shake flask cultivations. 
For the T. reesei ALKO 2221 transformants the corresponding amounts tested 
in shake flask cultivations were as follows: for the linear plasmids. three 
pALK173A and five pALK173B and for the circular plasmids six pALK173A 
and six pALK173B transformants. 

D. Phytase production by the Trichoderma transformants 
The best phytase production levels from transformants of T. reesei 
ATCC56765. ALKO 233 and ALKO 2221. without E. coli sequences, when 
the cbh\ promoter (pALK171 and pALK172 fragments) was used are shown 
in Table 7. 
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About 3600 PNU/ml was obtained with the best transformant. About 
the same level of production could be achieved by using both the strains 
T. reesei ALKO 233 and ALKO 2221. The best phytase producing T. reesei 
ATCC56765 transformant produced about 1.800 PNU/ml. Both the phytase 

5 and the cbh\ signal sequence seemed to work equally well and the same levels 

in phytase production could be achieved when using T. reesei ALKO 2221 or 
ATCC56765 as a host strain. In T. reesei ALKO 233 the level of phytase 
activity produced was higher when the phytase signal sequence was used. 

Some of the transformams did not produce any detectable CBHI protein 

10 which most probably indicates integration of the transforming DNA to the 

cbh\ locus. The absence of the CBHI protein did not affect the production 
levels in the screened transformams, i.e., good producers were found both 
among the transformams producing normal amounts of CBHI as well as among 
CBHI negative strains (ALKO 233, ALKO 2221). 

15 The best phytase production levels obtained by the use of pALK171 

and pALK172 circular and linear plasmid are shown in the Table 8. The best 
production yields were obtained with the T. reesei ALKO 2221 that had been 
transformed with the linear plasmid pALK171 (phytase signal sequence). 
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Table 8. Phytase production by the T. reesei strains transformed with 
circular or linear plasmid pALK171 or pALK172 



Strain 


Plasmid 


Plasmid form 


Transformant 


PNU/ml 


ALKO 233 


pALKI71 ' 


circular 


C13 


650 








C23 


530 








C4 


240 






linear 


A22 


1,610 








A2I 


1.270 








A 1 7 






pALK172 


circular 


C13 


1.360 








C21 


540 








C 1 








linear 


A20 


1.420 








A27 


1.330 








r\ j — 


"OU 


ALKO 2221 


pALK171 


circular 


C2 


^ 1,630 








C8 


1.290 








C32 


810 






linear 


A14 


3.800 








A8 


3,660 








Din 


.3,0 1U 




pALK172 


circular 


C3 


480 








C4 


190 










170 






linear 


B18 


2.060 








A36 


1,790 








B9 


1.390 


ATCC56765 


pALK171 


circular 


A74 


2.030 








A75 


1.980 








Bll 


1,870 




pALK172 


circular 


B3 


2.250 








B23 


1.970 








Bl 


1,030 



Table 8. Phytase production by the T. reesei strains transformed widi circular or 
linear plasmids pALK171 and pALK172. Phytase activities as PNU/ml in the culture 
supernatants of the three best phytase producing transformants of each type are shown. T. 
reesei ATCC56765 transformants were purified through conidia before cultivations and the 
results are averages from rwo shake flask cultivations. Inocula for cultivations of the 7*. reesei 
ALKO 233 and ALKO 2221 transformants were taken from the acetamide slants and the 
results shown are from one shake flask cultivation. 
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Also the A. niger phytase promoter can promote the expression of the 
gene in Trichoderma. However, the enzyme yields obtained are much lower 
than with the cbh\ promoter homologous to 7". reesei: the activities obtained 
from the culture supernatants of transformants containing the phytase' s own 
promoter were from about 1 to about 14 PNU/ml for the 7*. reesei ALKO 233 
transformants and from about 6 to about 120 PNU/ml for the T. reesei ALKO 
2221 transformants (Table 9). 



Table 9. Phytase production by the T. reesei strains transformed with 
circular or linear plasm id pAJLK173A or pAJLK173B 



Strain 


Plasmid 


Plasmid form 


Transformant 


PNU/ml 


AJLivU J. j j 


_ a I V 1 A 
pALN. 1 / jA 


circular 














0.0 








C23 








linear 




o. o 








JJlo 


D . O 








D6 


3.9 




pALK173B 


circular 


C8 


3.5 








C6 


< 








E10 


< 






linear 


D13 


5.5 








D31 


< 


ALKO 2221 


pALK173A 


circular 


E3 


32.5 






E5 


31.7 








C22 


25.0 






linear 


D24 


37.5 








D36 


8.3 








D17 


5.8 




pALK173B 


circular 


C13 


115.8 








C28 


65.0 








C27 


50.8 






linear 


D9 


36.7 








D26 


22.5 








D21 


18.3 



Table 9. Phytase production by T. reesei iransfomiants transformed with circular or 
linear plasmid pALK173A or pALK173B (phytase promoter). Phytase activities as PNU/ml 
in die culture supernatants or* the transformants are shown. Transformants have been purified 
through comdia before cultivation. The results shown are from one shake flask cultivation. 
A "less thair sign [ <) means that die value was below the detection level. 
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E. The enzyme background in the phycase preparations produced 
by T. reesei 

Phytase is expressed in the T. reesei strains in high amounts and the 
background of other enzyme activities in the supematants of T. reesei 
transformants is different from those in the Aspergillus supernatant (Table 7). 
Both endoglucanase and cellobiohydrolase activities are substantially higher 
when T. reesei is used as a production host compared to A. niger. The T. 
reesei strains used also produced proportionally lessglucoamylase activity than 
the A. niger ALKO 243 strain. 

F. Phytase protein produced by the Trichoderma transformants 
Samples from the growth media of the transformants (pALK171 

fragment) and the nontransformed T. reesei strains ATCC56765, ALKO 233 
and ALKO 2221 were analyzed in Western blots (Figure 10). Briefly, the 
following samples were analyzed: Lane 1 : 50 ng of purified Aspergillus ALKO 
243 phytase: Lane 2: 15 ng of endoF-treated Aspergillus ALKO 243 phytase; 
Lanes 3 and 10: T. reesei ALKO 233; Lanes 4-5 and 11-12: T. reesei ALKO 
233 transformant 171FR/ A4 and A13, respectively; Lanes 6 and 13: T. 
reesei ALKO 2221; Lanes 7-8 and 14-15: T. reesei ALKO 2221 transformant 
171FR/A5 and A9, respectively: Lane 9: T. reesei ALKO 2221 transformant 
D2: Lane 16: T. reesei ATCC56765: Lanes 17, 18. 19: T. reesei ATCC56765 
transformants 171FR/A21, All. and A23, respectively. In each case. 2 ^1 of 
1:10 dilution of the culture supernatant were run on the gel. 171 FR is the 
host transformed with the Xbal fragment from the plasmid pALK171. 

The molecular weight of the phytase produced by Trichoderma differed 
from that produced by Aspergillus and the difference seemed to be due to 
differences in the glycosylation level. The phytase secreted by T. reesei ALKO 
233 was visible in the Western blots as three and that secreted by T. reesei 
ALKO 2221 as 6 to 9 major protein bands of sizes of about 45-65 kDal. the 
lowest of which corresponded in size the deglycosylated Aspergillus phytase 
(45-48 kDal in SDS-PAGE). The phytase secreted by T. reesei ATCC56765 



was of a size of 65-80 kDal and consisted of three to rive protein bands. The 
molecular weight of the native Aspergillus phytase run in SDS-PAGE is about 
80-85 kDal. The phytase protein produced by the transformants that had been 
transformed with pALK172 fragment or pALK173A/B showed the same kind 
of banding pattern. 

III. Conclusions 

The production level of Aspergillus phytase obtained when T. reesei 
was used as a production. host was surprisingly high. By using 7". reesei. the 
phytase is produced in a novel background differing from that of Aspergillus 
and containing enzymes important eg. in feed applications. The molecular 
weight of the Aspergillus phytase protein produced in Trichoderma is different 
from that of Aspergillus. This difference in size seemed to be dueYo different 
glycosylation but did not affect the enzyme activity. 

EXAMPLE 7 

Production of Aspergillus niger pH 2.5 Acid 
Phosphatase in Trichoderma reesei 

I . EXPERIMENTAL PROTOCOLS 

1. Strains and plasmids 

E. coli strains XL1 -Blue (Bullock, W.O. ei aL. Biotechniques 5:376- 
379 (1987); Stratagene. La Jolla. CA, USA) and Sure™ (Greener. A.. 
Strategies J:5-6 (1990); Stratagene, La Jolla. CA, USA) were used as hosts 
for constructions made of plasmids pALK601 (Figure 12 and 13) and pAP-1 
(Figure 12 and 13). The plasmid pALK601 contains the T. reesei cbh\ 
promoter and terminator sequences and the Aspergillus nidulans acetamidase 
gene. The plasmid pAP-1 contains pH 2.5 acid phosphatase gene from 
Aspergillus niger ■ var. awamori ALKO 243 (ATCC 38854). 
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Trichoderma reesei strain ALKO 2221. a low aspanyl protease mutant 
derived from the T. reesei strain ALKO 233 (VTT-D-79125) by UV- 
mutagenesis was used as a recipient tor the pH 2.5 acid phosphatase gene. 

2. Growth media and culture conditions 

£. coli strains were grown in L-broth (Maniatis, T., et al.. Molecular 
Cloning, A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, USA (1982)) supplemented with ampicillin (50 jig/ml) when 
needed. E. coli cultivations were carried out at 37 °C overnight. 

PD agar slants (Potato Dextrose broth by Difco, Detroit, Michigan, 
USA) were used for storing the Trichoderma strains. The plates and media 
for T. reesei transformations were essentially as in Penttila et al. (Penttila, M. ? 
et aL, Gene 61: 155-164 (1987)). The transformants were purified on selective 
acetamide-CsCl medium (Penttila, M. et aL. Gene 67:155-164 (1987) before 
transferring to PD slants. T. reesei transformants were grown in lactose based 
medium (see example 6 subparagraph 2) at 30°C (250 rpm) for 7 days for 
expression of pH 2.5 acid phosphatase under the control of the cbh \ promoter. 

3. Manipulation of DNA 

Manipulations of DNA were performed as described above for phytase. 
mainly by standard methods (Maniatis, T.. et aL. Molecular Cloning, A 
20 Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor. 

NY, USA (1982)). Plasmid DNA from E. coli was isolated by using Qiagen 
columns (Diagen GmbH. Dusseldorf. FRG) according to the supplier's 
instructions. For rapid screening of the plasmid DNA from E. coli, the 
method of Holmes and Quigley, (Holmes and Quigley, Anal. Biochem. 
25 7/4:103-197 (1981)) was used. The restriction enzymes, T4 DNA ligase. 

Klenow fragment of DNA polymerase I and T4 DNA polymerase used in the 
DNA manipulations were from Boehringer (Mannheim, FRG) and New 
England Biolabs (Beverly. MA. USA). Each enzyme was used according to 
the supplier's recommendation. DNA fragments for cloning or 
30 transformations were isolated from low melting point agarose gels (FMC 



10 



Bioproducts. Rockland. ME. USA) by the freeze thaw phenol method 
(Benson. S.A. Bio/Techniques 2:66-68 (1984)) or by using the Mermaid™ Kit 
(BIO 101 Inc., La Jolla. CA. USA) according to the supplier's instructions. 

Sequencing of the fusions between the cbh\ promoter and pH 2.5 acid 
phosphatase gene was carried out by means of pUC/M13 primers and 
extension primers using Taq DyeDeoxy™ Terminator Cycle Sequencing Kit 
(Applied Biosystems) and the automated sequencer (Applied Biosystems 373A. 
Foster City, CA, USA). 

The oligonucleotides used were synthesized by an Applied Biosystems 
(Foster City, CA. USA) 381 A Synthesizer except the M13 primers that were 
purchased from the Applied Biosystems. 

4. Transformations 

Transformations were also performed as described above for phytase. 
Transformation of £. coli strains XL 1 -Blue or Sure was performed by the 
supplier's method (Stratagene La Jolla, CA, USA), T. reesei strains were 
transformed essentially according to the method of Penttila et al. (Penttila, M. , 
et aL, Gene 67:155-164 (1987)). Novozym 234 used in fungal protoplast 
preparation for transformations was from Novo Industri AS (Copenhagen. 
Denmark). Prior sporulating on PD slants T. reesei transformants were 
purified through conidia on the selective acetamide medium. 

5. Enzyme activity assays 

For the enzyme assays the mycelium was separated from the culture 
medium by centrifuging for 15 min at 3.000 rpm (Sorvall SS-34. Dupont 
Company, Wilmington. Delaware, USA). The pH 2.5 acid phosphatase 
enzyme activity was measured from the culture supernatant using 
paranitrophenylphosphate (Sigma. St. Louis, USA) as a substrate as described 
earlier. One pH 2.5 acid phosphatase activity unit releases 1 nmol of inorganic 
phosphate per minute on the substrate p-nitrophenylphosphate in pH 2.5 at 
37°C. One acid phosphatase normalized unit (APNU) is defined as the 
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amount of acid phosphatase activity produced by the A. niger ALKO 243 
strain under the cultivation conditions used (see example 6. subparagraph 2). 

Amyloglucosidase activity (AGU) was measured bv using 1% 
Zulkowskv starch (Merck) as a substrate and measuring the amount of the 
released glucose units by boiling with DNS reagent (see below, after 10 min 
of reaction at 60°C at pH 4.8. Proteases (HUT) were measured at P H 4.7 as 
in Food Chemicals Codex (Food Chemicals Codex, National Academy Press 
Washington, DC, USA, pp. 496-497 (1981)) by using 2% haemoglobin 
(Sigma, as a substrate. -Endoglucanase (ECU) and cellobiohydrolase (FPU) 
activities were measured as in IUPAC's Measurement of cellulase activities 
(IUPAC Commission on Biotechnology, Measurement of Cellulase Activities 
Biochemical Engineering Research Centre, Indian Institute of Technology 
Delhi, India, pp. 5-7 and 10-11 (1981)). 1% hydroxyethylcellulose (Fluka 
AG) in 50 mM Na-citrate buffer (pH 4.8) and Whatman no. 1 paper were 
used as substrates, respectively. DNS used differed from that described at the 
lUPAC's Measurement of cellulase activities (IUPAC Commission on 
Biotechnology, Measurement of Cellulase Activities, Biochemical Engineering 
Research Centre, Indian Institute of Technology, Delhi, India, pp. 5-7 and 10^ 
11 (1984)) and was made by first diluting 50.0g 2-hydroxy-3,5- 
dinitrobentsoicacid (Merck) into 4 liters of deionized water. Then 80.0g 
NaOH was added slowly by using the magnetic stirrer and 1.500g K-Na- 
tartrate (Merck) was added and diluted by heating the solution Jaximum 
temperature 45°C). The total volume was adjusted to 5 liters, the solution 
was filtered through Whatman no. i and was protected from light. 
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6. SDS-page and Western blot analysis 

Sodium dodecyl sulphate poiyacryiarnide gel electrophoresis (SDS- 
page) and Western blot analysis were done according to the methods ot 
Laemmli (Laemmli. U.K., Nature 227:680-685 (1970)) and Towbin et at. 
5 (Towbin. H., et al.< Proc. Natl. Acad. Sci. USA 76:4350-4354 (1979)). 

Visualization of the pH 2.5 acid phosphatase protein in Western blots was 
done by using the polyclonal rabbit antiserum KH1269. KH1269 was made 
against purified deglycosylated pH 2.5 acid phosphatase protein (M. Turunen, 
Alko Ltd.) and it was .supplied by the National Public Health Institute 

10 (Helsinki, Finland). Visualization of the CBHI protein from the pH 2.5 acid 

phosphatase transformants in Western blots was done by using the mouse 
monoclonal antibody CI-261 (Aho ? S. et aL. Eur. J. Biochem 200:643-649 
(1991)). Anti-rabbit-IgG and anti-mouse-IgG alkaline phosphate conjugate and 
color development substrates from ProtoBlot™ lmmunoblotting system 

15 (Promega, Madison, USA) were used to detect the immunocomplexes. 



7. PCR 

The PCR reactions were performed by a Techne thermal cycler PHC-2 
(Techne Ltd., Cambridge, UK) in 100 /xl volumes. The reaction mixture 
contained 0.2 mM of each dNTP (Pharmacia pH 8.3), 20 - 50 pmol of each 

20 primer and 10 ng of plasmid template in 10 mM Tris buffer (pH 8.3), 50 mM 

KC1, 1.5 mM MgCK and 100 ixg/ml gelatin. The protocol used was the 
following: 96°C/ 10 min before adding the Taq DNA polymerase (2 units. 
Boehringer Mannheim, FRG) and 100 /xl of paraffin oil. denaturation 95°C / 
1 min, annealing 60°C /l min, extension 72°C/ 1 min for 30 cycles. The 

25 final extension time was 9 min to ensure completion of the strand synthesis. 

The PCR fragments were purified by Mermaid™ Kit. The ends of the 
fragments were filled by using the DNA polymerase 1 Klenow fragment. 
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II. RESULTS 

A. Vector constructions for overexpression of the pH 2.5 acid 
phosphatase gene in Trichoderma reesei ALKO 2221 

1. Construction pf plasmid pALK533 
The plasmid pALK533 consists of Aspergillus niger var. awamori 
ALKO 243 pH2.5 acid phosphatase gene with its own signal sequence inserted 
into the Trichoderma reesei expression casette containing the cbhl promoter 
and terminator sequences. pALK533 also contains the Aspergillus nidulans 
amdS gene as a selection marker for transformants. 

The precise fusion between the cbhl promoter and the pH 2.5 acid 
phosphatase signal sequence was done with PCR. The^rimers used for the 
PCR fragments are shown in Figure 11. The Sacll site in the cbhl promoter 
area was used in the 5 '-primer and the Mlu\ site of the acid phosphatase gene 
(374 nucleotides down from the N-terminal of the acid phosphatase gene) was 
used in the 3 '-primer. The 5 '-primer was a 39-mer containing a tail of 19 
nucleotides of the cbhl promoter sequence preceding the signal sequence 
joining exactly to the first 20 nucleotides of the acid phosphatase signal 
sequence. The 3 '-primer was a 30-mer of pH 2.5 acid phosphatase gene. 
pAP-1 (Figure 12) was used as a template in the PCR reactions. A fragment 
of the expected length of 466 bps was obtained from the PCR reaction. 

The 466 bp PCR fragment containing the pH 2.5 acid phosphatase 
signal sequence was digested with Mlul and ligated to pAP-1 that had been 
digested with M/zdlll, treated with DNA polymerase I Klenow fragment and 
digested with Mlul to obtain plasmid pi 02 (Figure 15). The fusion and the 
PCR fragment were sequenced to ensure that no mistakes had occurred in the 
PCR amplification. 

To construct plasmid pALK533 (Figure 12). a pH 2.5 acid phosphatase 
gene containing the fusion was isolated from the plasmid pl02 as an Sph\ 
(filled in with DNA polymerase I Klenow fragment) - Sacll fragment and 
inserted between the cbhl promoter and terminator of the plasmid 
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pALK601(AAtfeI) that had been digested with Ndel (filled in with DNA 
polymerase I Klenow fragment) andSacII. In pALK601(AAW<?lh the Ndel site 
in the intron area of the amdS gene in pALK601 is inactivated using DNA 
polymerase I Klenow fragment. The linear fragment used for transformations 
was digested out from the vector backbone with Xbal. 

2. Construction of the plasmid pALK532 
The plasmid pALK532 consists of the Aspergillus niger var. awamori 
ALKO 243 pH 2.5 acid phosphatase gene inserted into the Trichoderma reesei 
CBHI expression casette containing the cbh\ promoter and signal sequence and 
terminator sequences. pALK532 also contains the Aspergillus nidulans amdS 
gene as a selection marker for transformants. 

The precise fusion between the cbh\ signal sequence and "the pH 2.5 
acid phosphatase gene was done with PCR. The primers used for PGR 
fragments are shown in Figure 11. The Sfil site in the cbh\ signal sequence 
was used in the 5 '-primer. The 5 '-primer was a 46-mer containing a tail of 
28 nucleotides joining exactly to the first 18 nucleotides of the acid 
phosphatase N-terminal sequence. The 3 '-primer was the same 30-mer used 
in the construction of pALK533. pAP-1 was used as a template in the PCR 
reaction. A fragment of the expected length of 418 bps was obtained from the 
PCR reaction. 

The 418 bp PCR fragment containing the cbh\ signal sequence was 
digested with Mlu\ and ligated to pAP-1 that had been digested with Hin&lU. 
treated with DNA polymerase I Klenow fragment and digested with Mlul to 
obtain plasmid p51 (Figure 13). The fusion and the PCR fragment were 
sequenced to ensure that no mistakes had occurred in the PCR amplification. 
To construct the plasmid pALK532 (Figure 13). a pH 2.5 acid phosphatase 
fragment containing the fusion was isolated from the plasmid p51 as a Sphl 
(tilted in with DNA polymerase I Klenow fragment) - Sfil fragment and was 
inserted between the cbh\ promoter and terminator areas of the plasmid 
pALK601(Aj\tfd). pALK601(AAWeI) had been digested with Ndel and filled 
in with the DNA polymerase I Klenow fragment and digested with Sfil. The 
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approximately 7.8 kb linear fragment that contained no bacterial sequences 
was isolated from pALK532 by restricting with Xbal and was used for 
transformations. 



B. Transformation of Trichoderma reesei and screening of the 
transformants ° 

Trichoderma reesei ALKO 2221 was transformed separately with the 
linear Xbal fragments from the plasmid pALK532 and pALK533. 
Transformation frequencies (transformants / M g of DNA) varied from 2 to 30. 

Forty-four T. reesei ALKO 2221/pALK532 transformants and 103 
PALK533 transformants were purified through conidia and were cultivated in 
shake flasks. 



C. pH 2.5 acid phosphatase production by the Trichoderma 
transformants 

The best transformants based on the pH 2.5 acid phosphatase 
production are shown in the Table 10. The best enzyme activity level was 240 
APNU in shake flask cultivation in lactose based medium. 



WO 94/03612 



PCT/FI93/00310 



-81- 



3 
< 



30 



c; O O O 

OQ C\ vO Cs 

m -rr t 



o ^ o o 
on =: r-j := 
^ 4— in ^> 



f^l f*J vC 

m — io — 



a ^ — c 



to 



mi — t r- 

on m m 



Q m C Q c C 

z ^ z: z ^ z 



3 
< 



T+- + 4- + + ^ 



ri r*i t^i r*i — — — — 



r""> m m rn rn m m rn 
iy~i t/n «0 m wo i/"^ 

<<<<<<<<<< 



CO 



■a 



=3 ^ 
= < 



= — 



^ — t ri n ^ 
_ ^. „^ ^ C2 ✓ ^ (J 













5 




-IT 


re 


— , 


— 


C- 


















l 












re 








< 


r*" J 


















"> 


'V: 






u 




re 










.o 


no 


to 






CO 


1 






CJ 




CC 


















o 


3 






1 


> 



5 

o 



on C 
re — 



co re 22 



+ 



c ^ 
re — = 

fa i. 



re 



2! 5 



< — 



: — re 



SUBSTBTUTE SHEET 



WO 94/03612 



PCI7FI93/00310 



- 82 - 

Both the acid phosphatase and the cbh\ signal sequence worked equally 
well and about the same level of pH 2.5 acid phosphatase activity could be 
achieved. The best pH 2.5 acid phosphatase activity level was produced by 
Trichoderma transformant SC-9 and was about 250 fold greater than the levels 
produced by native Aspergillus niger var. awamori ALKO 243 strain in 
corresponding conditions. 

Two out of the nine best producers did not react with the monoclonal 
CBHI antibody in Western blot analysis suggesting that the expression casette 
had integrated. to the cbhJ locus in those two transformants (Table 10). 

D. The enzyme background in the pH 2.5 acid phosphatase 
preparations produced by T. reesei 

The pH 2.5 acid phosphatase is expressed in the T. reesei transformants 
in high amounts and the background of some other enzyme activities in the 
supernatants of T. reesei transformants is different from those in the 
Aspergillus supernatant (Table 10). Both endoglucanase and cellobiohydrolase 
activities are significantly higher when T. reesei is used as a production host. 
The T. reesei transformants also produced proportionally less glucoamyiase 
activity than the A. niger ALKO 243 strain. 

E. Identification of the pH 2.5 acid phosphatase produced by the 
Trichoderma transformants 

Samples from the growth media of the transformants and the T. reesei 
ALKO 2221 strain were analyzed in Western blot (Figure 14). The following 
samples were analyzed: lOng of purified Aspergillus ALKO 243 pH 2.5 acid 
phosphatase: lOng of endoF treated Aspergillus ALKO 243 pH 2.5 acid 
phosphatase: and 60ng of protein from the each of the culture supernatants of 
Trichoderma reesei ALKO 2221 transformants SC-9. KA-31. KA-17. KB-44. 
KB-18. SB-4 and KA-28 (Figure 14). 

The pH 2.5 acid phosphatase secreted by T. reesei transformants was 
seen as four protein bands of sizes of about 50 - 66 kD. This is probably due 
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to the different level of glycolysation of the protein pan of the secreted pH 2.5 
acid phosphatase. Compared to the size of the pH 2.5 acid phosphatase 
produced by Aspergillus niger var. awamori ALKO 243 strain (66 kD) a 
majority of the pH 2.5 acid phosphatase proteins produced by T. reesei are 
smaller than that produced by Aspergillus. 

All references are incorporated herein by reference. Having now fully 
described the invention, it will be understood by those with skill in the art that 
the scope may. be performed with a wide and equivalent range of concentra- 
tions, parameters, and the like, without affecting the spirit or scope of the 
invention or any embodiment thereof. 
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SEQUENCE LISTING 



Hi GENERAL INFORMATION: 

Si) APPLICANT: Nevaiair.en, Helena K.M. 

Paloheimo, Maria T. 
Miettinen-Oinonen, Arja S.K. 
Torkkeli, Tuula K. 
Cantreil, Michael 
Piddington, Christopher S. 
Rambosek, John A. 
Turunen, Marja K. 
Fagerstrom, Richard B. 

(ii) TITLE OF INVENTION: Production of Phytase Degrading Enzyrm 
in Trichoderma 

(iii) NUMBER OF SEQUENCES : 66 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: 

( B ) STREET : 

(C) CITY: 

(D) STATE: 

( E ) COUNTRY : 

(F) ZIP : 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

tC) OPERATING SYSTEM: PC-DOS/MS-DOS 

<D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/923,724 

(B) FILING DATE: 31-JUL-1992 

(viii) ATTORNEY /AGENT INFORMATION: 
(AJ NAME: 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 

(ix> TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 

(B) TELEFAX : 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2071 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



iixi FEATURE: 

i A) NAME /KEY: CDS 

•3) LOCATION: j c in ( 1 3 6 . . 9 16 , 971. .1088, I14I..1245, 1305. .1727'- 
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Xl ; SEQUENCE DESCRIPTION: SEQ ID NO : I : 

TGGA CCGCAATC7C CGATCGCCGG 3TATAAAAGG TCC7CCAAAC CCC7C7CGG7 

CGA7ATGTAC CCCGCTC3TC ATC7CCAATC CTCTCGAGAG CACC77C7CC AGC7777GTC 

AA77G7ACC7 TCGCA A73 CC7 CGC ACC TCT C7C C7C ACC CTG GCC 737 GCT 
r/ e - p ro Arg Thr Ser Leu Leu 7r.r ueu Aia Cys 

— 3 GCC ACG GGC GCA TCC GCT 77C TCC TAC GGC GC7 GCC A7T OCT CAG 

- * u Ala Thr Gly Ala Ser Aia Phe Ser Tyr Gly Ala Ala Pre Gin 

15 20 25 



110 



3 AG 3TC A7G GGC GCG GAC AGC C7C ACG CCC ACC TGT 3AC ACC GAC 
----- "al M«t G'V Ala Asd Ser Leu Thr Pro Thr Cys Asp Tnr Asp Asn 
15 ' " "* 210 215 2 2 0 



22S 220 



GCT GCC CGC CTA AAC TCC CAG AAC CCC GGC A- 3 AAC CTC ACC 
50 Val Ala Aia Aia Arg Leu Asn Ser Gin Asn Pre Gly y.er Asn Leu Tr.r 



215 



10 ~CA ACC CAG GAG AAG CAG TTC TCT CAG GAG TTC CGC GAT GGC TAC AGC *67 
q^>- -rhr Gin Glu Lvs Gin Phe Ser Gin Glu Phe Arg Asp Gly Tyr Ser 
30 * 35 40 

A"C CTC AAG CAC TAC GGT GGT AAC GGA CCC TAC TCC GAG CGT GTG TCC 315 
Leu Lvs His Tvr Gly Glv Asn Gly Pro Tyr Ser Glu Arg Val Ser 
15 45 ' * 50 55 50 

-AC GGT ATC GCT CGC GAT CCC CCG ACC AGC TGC GAG G7C GAT CAG GTC 3 63 

Tv- Gly lie Ala Arg Asp Pro Pro Thr Ser Cys Glu Val Asp Gin Val 
65 70 75 

ATC ATG GTC AAG CGT CAC GGA GAG CGC TAC CCG TCC CCT TCA GCC GGC- 411 
~>0 T le Met Val Lys Arg His Gly Glu Arg Tyr Pro Ser Pro Ser Ala Gly 
80 85 90 

AAG GAC ATC GAA GAG GCC CTG GCC AAG GTC TAC AGC ATC AAC ACT ACT 4 59 

Lvs Asp He Glu Glu Ala Leu Ala Lys Val Tyr Ser He Asn Thr Thr 
95 100 105 

~>5 GAA TAC AAG GGC GAC CTG GCC TTC CTG AAC GAC TGG ACC TAC TAC GTC 5 07 

Glu Tvr Lys Gly Asd Leu Ala Phe Leu Asn Asp Trp Thr Tyr Tyr Vai 
--- ' 115 120 



r C - AAT GAG TGC TAC TAC AAC GCC GAG ACC ACC AGC GGC CCC TAC GCC 
S-o Asn Glu Cvs Tyr Tyr Asn Ala Glu Thr Thr Ser Gly Pro Tyr Ala 
30 125 * 130 135 140 

~ G ~ — G CTG GAC GC7 TAC AAC CAT GGC AAC GAT TAC AAG GCT CGC TAC 
Glv Leu Leu Asd Ala Tyr Asn His Gly Asn Asp Tyr Lys Axa Arg Tyr 
* 145 150 15a 

GGC CAC CTC TGG AAC GGT GAG ACG GTC GTG CCC TTC 777 TCT AGT GGC 
35 His Leu TrD Asn Glv Glu Thr Val Val Pro Phe Fne Ser Ser Giy 

160 ' 155 170 

"AC GGA CGT GTC ATC GAG ACG GCC CGC AAG TTC GGT GAG GGT TTC TTT 
Glv Arg Val He Glu Thr Ala Arg Lys Phe Gly Glu Gly Phe Phe 
* 175 180 135 

40 3GC TAC AAC TAC TCC ACC AAC GCT GCC CTC AAC ATC ATC TCC GAG TCC 74 7 

' - "vr Asn Ty- Ser Thr Asn Ala Ala Leu Asn He -_e Ser G±u Se_ 
: 90 195 . 200 



603 



651 



699 



795 



3AC CAG ACC ACC TGC GAC AAC CTG ACT TAC CAG CTG CCC CAG TTC AAG 64 3 

Asp Gin Thr Thr Cys Asp Asn Leu Thr Tyr Gin Leu .-re Gin Phe las 



391 



--5 250 
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Ala Se: 



S ?If j££ S SI? G GTA — ^ T A C G G 7 A C AA TCATTGGCTC 

255 ISC 



AAA: 



:7CCAG CTGACAGCA7 CCTAG 7~ 



GCC ~" — 



, " 'Ort-J ^ - ^ i-Lrti 

V-l Met Ala Ser ?r.e Glu Leu Asn Ai, 

2 c 5 



C7C AA7 GC7 



Arg III ill IH jSS S ? AC C ;F T ACC CAG GAC GAA T ^G =TC 

2 70 Asn Aia ?h - Thr Gin Aso Giu Tro Val 

- = 230 - 23 - 

10 AGC TTC GGT TAC GTT GAG GAT TTG AAC TAC TAC T^C TGC GCT rr r 

Ser rn.e Glv Tvr Val n,, a„ T * Z. lut - GC T GG G 

290 yr Tyr Tyr Cys Ala G1 / 

295 300 
TGAGTTTACC ATTTGATCCA TTATTGTCTT GGATCAGCTA ACGATCGATA G T CCC 
15 Pro 

m m & xs si s si? sj s - - - - s s 

310 315 

20 S « 5 2S J£ as S* S£ Si E S? IS IS 

325 330 

£n J*. 3 ™*"™ GCAGAATCAG AGTCTCACAA AAAGAAACTC TTCACTAACA 

235 

— T S SS £ ?S ^ ?K ?£ £ ffi s S JS S 
S5 i S K S i£ SS SS S S S SIJ £f s 
S, c iE S E JIf S?5 ^ SI? s £? SJ Sg 21 s 

35 s in a; s s ??; a- s s « - - - «. J 

390 395 

£F C I AC CGT CTG G7G CTG AAC GAG GC7 G T A C~C — — r >ao 

Thr .vr Val .Arg Leu Val Leu Asn Glu Ala SiJ ^ J« Pne" JIS ^ 

" ** 05 410 

40 Jvs ?g HI Qiy G ?5 £5 JfE P C GCT r G **= TAC AGC TGC 

4is y ly T> - Cys Pro L = u Ala Asn Tvr Thr Ser 

ATC CTG AAC AAG AAT CTG CCA GAC TAC ACG ACC ACC ~3C AAT G~C — 
— «sn Ly . Asn Leu Pro Asp Tyr Thr Thr Thr Cvs JiJ VaJ 

440 



25 



30 



45 



Ala 
44 5 

ACG 



TAC CCG CAG 7A7 CTG AGC 77C 7GG 



50 



= r Tyr Pro Gin Tyr L-=u Ser 
450 

«3 CTG AAC TAC CGC TCT AGC C: 
Leu Asn Tvr Arq Ser Ser 
4€5 



AAC 7AC AAC ACC ACG 



ie Trp 7rp Asn Tyr Asn Thr Thr 
455 460 

' ^ Z't' Jr" GAG GGT GAT 

" C " ~ - " ia c >'s Gin Glu Glv Asd 
470 



.04- 



1089 



L44 



-1S2 



L24C 



-2SS 



1344 



1440 



1488 



1536 



158^ 



1660 
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i"3 GAG TAGATGCAGA GGGGTAGGTC CCGGGATAC. . .AGTGmon 
Met Asd 



^GTTTGGTGG TGACGATCAC CTTGTTAATA GTCTTGTACA GTCATACGGT _3.s 



ISO 



19 0 



40 Ser Thr Asn Aia Ala Leu Asn He He Ser Glu Ser Glu Val Met Gly 
- r. =; 2 0 C 2 C 5 



1 S9 7 



5 GAATGTAAAT AATGATAATA GCAATGATAC ATGTTGGAAT CTCGTTTTG" 

cataggcgct TTGGGGGTGT ATTTTTAGGC GTTAGACTTA TTTTCAATTC GTGTATAATG 19 5 7 

CGGTCAGTAA AT G AATC AT C AATTATTCAA ATGCAATGCT GTATACGTGA AACTATTGGG 2 317 

T7AAGACGCA GCTACTAGCT GACTGCTTGG TTACTTTCTG TGTACACCGC ATGC 2C71 

INFORMATION FOR SEQ ID NO : 2 : 

10 ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

•ii) MOLECULE TYPE: protein 

15 <xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

w e r s-o Arq Thr Ser Leu Leu Thr Leu Ala Cys Ala Leu Aia Thr Gly 
* I * 5 10 15 

Ala Ser Ala Phe Ser Tyr Gly Ala Ala He Pro Gin Ser Thr Gin Glu 
20 25 30 

20 Lvs Gin Phe Ser Gin Glu Phe Arg Asp Gly Tyr Ser He Leu Lys His 
35 40 45 

Tvr Glv Glv Asn Gly Pro Tyr Ser Glu Arg Val Ser Tyr Gly He Ala 
5*0 * 55 60 

A -a Asd P-o Pro Thr Ser Cys Glu Val Asd Gin Val He Met Val Lys 
25 €5 ' 70 75 30 

Ara H<s Gly Glu Arg Tyr Pro Ser Pro Ser Ala Gly Lys Asp He Glu 
" 35 90 25 

G'u Ala Leu Ala Lvs Val Tyr Ser He Asn Thr Thr Glu Tyr Lys Gly 
100 * 105 110 

30 Asc Leu Ala Phe Leu Asn Asp Trp Thr Tyr Tyr Val Pro Asn Glu Cys 
115 120 125 

- vr Asn Ala Glu Thr Thr Ser Glv Pro Tyr Ala Gly Leu Leu Asp 
**" 120 135 140 

Tvr Asn His Glv Asn Asp Tvr Lys Ala Arg Tyr Gly His Leu Trp 
35 145 " * 130 155 ISO 

^s- ~lv Glu Thr Val Val Pro Phe Phe Ser Ser Gly Tyr Gly Arg Val 
" 165 170 175 

He Glu Thr Ala Arc Lys Phe Gly Glu Gly Fhe Phe Gly Tyr Asn Tyr 



.-. - a 



sd Ser Leu Thr Pro Thr Cys Asp Thr Asp Asn Asp Glr. Thr Thr 
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Cys Asp Asn Leu Thr Tyr Glr. Leu Pre Gin ?he Lvs Val Ala Ala Ala 
225 230 235 ' 240 

Arg Leu Asn Ser Gin Asn Pro Gly Met Asn Leu Thr Ala Ser Asn Val 
24 = 23C 255 

Tyr A sr. Leu Met Val Met Ala Ser Phe Glu Leu Asn Ala Arg Pre Phe 
260 265 27C 

Ser Asn Trp He Asn Ala Phe Thr Gin Asd Glu TrD Val Ser Phe Glv 
275 28C " * 235 

Tyr Val Glu Asp Leu Asn Tvr Tyr Tvr Cvs Ala Glv Pre Glv Asd Lys 
290 295 * " * 300 

Asn Met Ala Ala Val Gly Ala Val Tvr Ala Asn Ala Ser Leu Thr Leu 
305 210 315 320 

Leu Asn Gin Gly Pro Lys Glu. Ala Gly Ser Leu Phe Phe Asn Phe Ala 
325 330 335 

His Asp Thr Asn He Thr Pro He Leu Ala Ala Leu Gly Val Leu He 
340 345 350 

Pro Asn Glu Asd Leu Pro Leu Asd Arg Val Ala Phe Glv Asn Pro Tvr 

3 55 3 60 3 65 

Ser He Gly Asn He Val Pro Met Glv Glv His Leu Thr He Glu Arg 
370 375 * * 380 

Leu Ser Cys Gin Ala Thr Ala Leu Ser Asp Glu Gly Thr Tyr Val Ara 
335 390 " 395 400 

Leu Val Leu Asn Glu Ala Val Leu Pro Phe Asn Asp Cys Thr Ser Gly 
405 410 415 

Pro Gly Tyr Ser Cys Pro Leu Ala Asn Tyr Thr Ser He Leu Asn Lys 
420 425 430 

Asn Leu Pro Asp Tyr Thr Thr Thr Cys Asn Val Ser Ala Ser Tyr Pro 

4 3 5 44 0 44 5 

Gin Tyr Leu Ser Phe Trp TrD Asn Tyr Asn Thr Thr Thr Glu Leu Asn 
450 455 460 

Tyr Arg Ser Ser Pre He Ala Cys Gin Glu Gly Asp Ala Met Asd 
455 470 475 

{2i INFORMATION FOR SEQ ID NO : 3 : 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



.XI ) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
^YTAYGGNC AYGGNGC 

I : INFORMATION FCR SEQ ID NO : 4 : 

i ) SEQUENCE CHARACTERISTICS: 
{ A ) LENGTH : 1 7 base pairs 
( E ) TYPE: nucleic acid 
:C1 STRANDEDNESS: sinale 
V D) TCFOLOGY: cc:r. 
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ix: ) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 
CARGGNGTNG GNTAYGC 

12) INFORMATION FOR SEQ ID NO : £ : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TAYGTNGARA TGATGCARAA 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
ATGATG CAAA ATCAAGCTGA ACA 23 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: join<404 . .447, 550.. 1905) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CATCCAGGCA CCCTTTCCCA ACGGGGGAAC TTCCGTTGTC CACGTGCCCT GGTTCAGCCA 6 0 

ATCAAAGCGT CCCACGGCAA TGCTGGATCA ACGATCAACT TGAATGCAAT AAATGAAGAT 12 0 

GCAACTAACA CCATCTGTTG CCTTTCTCTC GAGAAAGCTC CTCCACTTCT CACACTAGAT 180 

TTATCCGTTC CTTGTCGACT TCCCGTCCCA TTCGGCCTCG TCCACTGAAG ATCTATCCCA 24 0 

CCATTGCACG TGGGCCACCT TTGTGAGCTT CTAACCTGAA CTGGTAGAGT ATCACACAAC 3 00 

ATGCGAAAGT GGGATGAAGG G GTT AT AT G A GGACCGTCCG G7CCSGCGCG ATGGCCGTAG 360 

CTGCCAATCG CTGCTGTGCA AGAAATTTCT TCTCATAGGC ATC ATG GGC GTC TCT 415 

Met Gly Val Ser 

GCT GTT CTA CTT CCT TTG TAT CTC CTA GCT GG C-7ATGCTAAG 4 57 

Ala Val Leu Leu Pro Leu Tvr Leu Leu Ala Gly 
5 10 15 
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wnC^jwiATC TAAGT CTG AT AAGGACCCTC TTTGCCGAGG GCCCCTGAAG CTCGGACTGT 5""" 

- oyjACiAC TGATCGCTGA CAATCTGTGC AG A GTC ACC TCC GGA CTG GCA 55 6 

Val Thr Ser Gly Leu Aia 

GTC CCC GCC TGG AGA AAT CAA TCC ACT TGC GAT ACG GTC GAT CAA GGG £16 

Val Pre Ala Ser Ara Asa Gin Ser Thr Cvs Aso Thr Val Aso Gin Glv 

25 20 25 

TAT CAA TGC TTC TCC GAG ACT TCG CAT CTT TGG GGT CAA TAC GCG CCG 6 64 

Tyr Gin Cys Phe Ser Glu Thr Ser His Leu Trp Gly Gin Tyr Ala Pro 

4 0 45 50 

TTC TTC TCT CTG GCA AAC GAA TCG GCC ATC TCC CCT GAT GTG CCC GCC 712 

Phe Phe Ser Leu Ala Asn Glu Ser Ala lie Ser Pro Asd Val Pro Ala 

55 60 65 

GGT TGC AGA GTC ACT TTC GCT CAG GTC CTC TCC CGT CAT GGA GCG CGG 76 0 

Gly Cys Arg Val Thr Phe Ala Gin Val Leu Ser Arg His Gly Ala Ara 

7 0 7 5 8 0 85 

TAT CCG ACC GAG TCC AAG GGC AAG AAA TAC TCC GCT CTC ATT GAG GAG 3 OS 

Tyr Pre Thr Glu Ser Lys Gly Lys Lys Tyr Ser Ala Leu lie Glu Glu 

90 95 vj.0 0 

ATC CAG CAG AAC GTG ACC ACC TTT GAT GGA AAA TAT GCC TTC CTG AAG 6 56 

lie Gin Gin Asn Val Thr Thr Phe Asp Gly Lys Tyr Ala Phe Leu Lys 

105 110 115 

ACA TAC AAC TAC AGC TTG GGT GCA GAT GAC CTG ACT CCC TTC GGA GAG 9 04 

Thr Tyr Asn Tyr Ser Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu 

120 125 " 130 

CAG GAG CTA GTC AAC TCC GGC ATC AAG TTC TAC CAG CGA TAC GAA TCG 9 52 

Gin Glu Leu Val Asn Ser Gly He Lys Phe Tyr Gin Arg Tyr Glu Ser 

135 140 145 

CTC ACA AGG AAC ATC ATT CCG TTC ATC CGA TCC TCT GGC TCC AGC CGC 10 CO 

Leu Thr Arg Asn He He Pro Phe He Arg Ser Ser Gly Ser Ser Arg 

150 155 ~ 160 165 

GTG ATC GCC TCC GGC GAG AAA TTC ATT GAG GGC TTC CAG AGC ACC AAG 104 8 

Val He Ala Ser Gly Glu Lys Phe He Glu Gly Phe Gin Ser Thr Lys 

170 175 180 

CTG AAG GAT CCT CGT GCC CAG CCG GGC CAA TCG TCG CCC AAG ATC GAC 109 6 

Leu Lys Asp Pro Arg Ala Gin Pro Gly Gin Ser Ser Pro Lvs He Asp 

" " 185 190 195 

GTG GTC ATT TCC GAG GCC AGC TCA TCC AAC AAC ACT CTC GAC CCA GGC 1144 

Val Val He Ser Glu Ala Ser Ser Ser Asn Asn Thr Leu Aso Pro Gly 

200 205 210 

ACC TGC ACT GTC TTT GAA GAC AGC GAA TTG GCC GAT ACC GTC GAA GCC 119 2 

Thr Cvs Thr Val Phe Glu Asd Ser Glu Leu Ala Asd Thr Val Glu Ala 

215 220 225 

AAT TTC ACC GCC ACG TTC GCC CCC TCC ATT CGT CAA CGT CTG GAG AAC 124 0 

Asn Phe Thr Ala Thr Phe Ala Pro Ser He Ara Gin Ara Leu Glu Asn 

220 235 240 " 245 
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CT3 TCC CCC TTC TGT GAC CTG TTC ACC CAT GAC GAA TGG ATC CAC TAC 1364 

leu Ser Pro Phe Cvs Asd Leu Phe Thr His Asd Glu Trn lie His Tyr 
26C " * 2S5 " 29C 

GAC TAC CTC CAG TCC CTG AAA AAA TAC TAC GGC CAT GGC GCA GGT AAC 1432 
Asd Tvr Leu Gin Ser Leu Lvs Lvs Tvr Tyr Gly His Gly Ala Gly Asn 
295 300 " " " 305 - 

CCG CTC GGC CCG ACC CAG GGC GTC GGC TAC GCT AAC GAG CTC ATC GCC 148 0 

Pre Leu Glv Pro Thr Gin Glv Val Glv Tvr Ala Asn Glu Leu lie Ala 
210 " 315 " " * 320 325 

CGT CTC ACC CAC TCG CCT GTC CAC GAT GAC ACC AGC TCC AAC CAC ACC 152 8 

Arg Leu Thr His Ser Pro Val His Asd Asd Thr Ser Ser Asn His Thr 
330 335 340 

TTG GAC TCG AAC CCA GCT ACC TTC CCG CTC AAC TCT ACT CTC TAC GCG 1576 
Leu Asd Ser Asn Pro Ala Thr .Phe Pro Leu Asn Ser Thr Leu Tyr Ala 
345 350 355 

GAC TTT TCC CAC GAT AAC GGC ATC ATC TCT ATC CTC TTT GCT TTG GGT 1624 
Asd Phe Ser His Asp Asn Gly lie lie Ser lie Leu Phe Ala Leu Gly 
360 365 370 

CTG TAC AAC GGC ACT AAG CCG CTG TCT ACC ACG ACC GTG GAG AAT ATC ^ 16 72 
Leu Tvr Asn G.v Thr Lys Pro Leu Ser Thr Thr Thr Val Glu Asn lie 
375 * 380 385 

ACC CAG ACA GAT GGG TTC TCG TCT GCT TGG ACG GTT CCG TTT GCT TCG 172 0 

Thr Gin Thr Asp Gly Phe Ser Ser Ala Trp Thr Val Pro Phe Ala Ser 
390 " 395 400 405 

CGT CTG TAC GTC GAG ATG ATG CAG TGC CAG GCC GAG CAG GAG CCG CTG 1768 
Arg Leu Tyr Val Glu Met Met Gin Cys Gin Ala Glu Gin Glu Pro Leu 
410 415 420 

GTC CGT GTC TTG GTT AAT GAT CGC GTT GTC CCG CTG CAT GGG TGT CCA 1816 
Val Arg Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys Pro 
425 * 430 435 



AT GCT TTG GGG AGA TGT ACC CGG GAT AGC TTT GTG AGG GGG TTG 18 64 

Asd Ala Leu Gly Arg Cys Thr Arg Asp Ser Phe Val Arg Gly Leu 
440 445 450 



AGC TTT GCT 
Ser Phe Ala 
455 


AGA TCT GGG GGT GAT TGG GCG GAG 
Arg Ser Gly Glv Asp Trp Ala Glu 
46 0 


TGT TCT GCT 
Cvs Ser Ala 
465 


1906 


TAGCTGAACT 


ACCTTGATGG 


ATGGTATGTA 


TCAATCAGAG 


TACATATCAT 


TACTTCATGT 


1966 


ATGTATTTAC 


GAAGATGTAC 


ATAT CGAAAT 


ATCGATGATG 


ACTACTCCGG 


TAGATATTTG 


2026 


GTCCCCTTCT 


ATCCTTCGTT 


CCACAACCAT 


CGCACTCGAC 


GTACAGCATA 


ATACAACTTC 


2086 


AGCATTAACA 


AACGAACAAA 


TAATATTATA 


CACTCCTCCC 


CAATGCAATA 


ACAACCGCAA 


2146 


TTCATACCTC 


ATATAGATAC 


AATACAATAC 


ATCCATCCCT 


ACCCTCAAGT 


CCACCCATCC 


2206 


t AT AAT C AAA 


TCCCTACTTA 


CTCCTCCCCC 


TTCCCAGAAC 


CCACCCCCGA 


AGGAGTAATA 


2266 


GTAGTAGTAG 


AAG AAG CAG A 


CGACCTCTCC 


ACCAACCTCT 


TCGGCCTCTT 


ATCCCCATAC 


2326 


"TATACACA 


CACGAACACA 


CCAAATAGTC 


AGCATG C 






2363 
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INFORMATION FOR SEQ ID NO : 8 



<L) SEQUENCE CHARACTERISTICS: 

i A ) LENGTH: 46 7 amino acids 
» 5 ) TYPE: amino acid 
3 (D) TOPOLOGY: linear 

iii; MOLECULE TYPE: protein 

ixij SEQUENCE DESCRIPTION: SEQ 10 NO : 6 : 

Met Gly Val Ser Ala Val Leu Leu Pro Leu Tvr Leu Leu Ala Glv Val 
■ 1 5 IC 15 

10 Thr Ser Gly Leu Ala Val Pro Ala Ser Arg Asn Gin Ser Thr Cvs Asd 
20 25 v 30 

Thr Val Asp Gin Gly Tyr Gin Cys Phe Ser Glu Thr Ser His Leu Tro 
35 " * 40 45 

Gly Gin Tyr Ala Pro Phe Phe Ser Leu Ala Asn Glu Ser Ala lie Ser 
15 50 55 60 

Pro Asd Val Pro Ala Gly Cys Arg Val Thr Phe Ala Gin Val Leu Ser 
c5 70 75 80 

Arg His Gly Ala Arg Tyr Pro Thr Glu Ser Lys Gly Lys Lys Yvr Ser 
85 90 * 95 

20 Ala Leu lie Glu Glu lie Gin Gin Asn Val Thr Thr Phe Asd Gly Lys 
100 105 lib 

Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr Ser Leu Gly Ala Asd Asp Leu 
115 120 125 



Thr Pro Phe Gly Glu Gin Glu Leu Val Asn Ser Gly lie Lvs Phe Tyr 
2:> 130 135 140 

Gin Arg Tyr Glu Ser Leu Thr Arg Asn lie lie Pro Phe lie Arg Ser 
145 150 155 160 

Ser Gly Ser Ser Arg Val lie Ala Ser Gly Glu Lys Phe lie Glu Gly 
165 170 175 

30 Phe Gin Ser Thr Lys Leu Lys Asd Pro Arg Ala Gin Pro Glv Gin Ser 
180 " " 185 190 

Ser Pro Lys lie Asd Val Val lie Ser Glu Ala Ser Ser Ser Asn Asn 
195 ■ * 200 205 

Thr Leu Asd Pro Glv Thr Cvs Thr Val Phe Glu Asd Ser Glu Leu Ala 
3D 210 ' * 215 220 

Asp Thr Val Glu Ala Asn Phe Thr Ala Thr Phe Ala Pro Ser lie Arg 
225 230 235 240 

GIr. Arg Leu Glu Asn Asp Leu Ser Gly Val Thr Leu Thr Asd Thr Glu 
245 " ' 250 " 255 

40 Val Thr Tyr Leu Met Asp Met Cys Ser Phe Asp Thr He Ser Thr Ser 
260 255 " 27C 

Thr Val Asd Thr Lvs Leu Ser Pro Phe Cvs Asd Leu Phe Thr His Asd 
275 * 2S0 * * 2S5 

3lu Trp He His Tyr Asn Tyr Leu Gin Ser Leu Lvs Lvs Tvr Tyr Glv 
45 290 " ~ 295 2 o C 
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His Gly Ala Gly Asn Pro Leu Gly Pro Thr Gin Gly Val Gly Tyr Ala 
305* " 3IC " 315* 32 0 

Asn Glu Leu He Ala Arg Leu Thr His Ser Pro Val His Asp Asp Thr 
325 33C *33 5 

Ser Ser Asn His Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn 
340 ~ 345 - 350 

Ser Thr Leu Tyr Ala Asd Phe Ser His Asp Asn Gly lie He Ser He 
355 * ' 360 365 

Leu Phe Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr 
370 375 380 

Thr Val Glu Asn He Thr Gin Thr Asp Gly Phe Ser Ser Ala Trp Thr 
385 390 395 400 

Val Pro Phe Ala Ser Arg Leu Tyr Val Glu Met Met Gin Cys Gin Ala 
405 * 410 415 

Glu Gin Glu Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Val Pro 
420 ~* 425 " ~ 430 

Leu His Gly Cys Pro He Asp Ala Leu Gly Arg Cys Thr Arg Asp Ser 
435 * " 440 445 

Phe Val Arg Gly Leu Ser Phe Ala Arg Ser Gly Gly Asp Trp Ala Glu 
450 * 455 460 

Cys Ser Ala 
465 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : S : 

CAACCGCGGA CTGCGCATCA TGGGCGTCTC TGCTGTTCT 

(2) INFORMATION FOR SEQ ID NO: 10: 

vii SEQUENCE -CHARACTERISTICS : 
(A) LENGTH: 22 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 

{xi) SEQUENCE DESCRIPTION: SEQ* ID NO:10: 

ATTTCTCGAG GCGGGGACTG CC 

12) INFORMATION FOR SEQ ID NO: 11: 

■ L) SEQUENCE CHARACTERISTICS: 
■A) LENGTH: 46 base pairs 
(3) TYPE: nucleic acid 
'C> STRANDEDNESS: single 
: D ) TOPOLOGY: both 
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xi SEQUENCE DESCRIPTION: SEC ID NO : 1 1 : 

CTCGGCC7TC TTGGCGAGAG CTCGTGCTCT GGCAGTCCCC 

-2': INFORMATION FOR SEQ ID NO: 12: 

- '' SEQUENCE CHARACTERISTICS: 
iA) LENGTH: 24 base oairs 
(3) TYPE: nucleic acid 
:C) STRANDEDNESS : sinaie 
( D ) TOPOLOGY : hot h 



:xi> SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
TTGGTGTCGA CGGTGCTGGT GGAG 
(2) INFORMATION FOR SEQ ID NO : 13 : 

'i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(Xij SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CTCGGCCTTC TTGGCCACAG CTCGTGCTTT CTCCTACGGC GCTGCC 
(2) INFORMATION FOR SEQ ID NO:14: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinaie 

(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCATGGTTG TACGCGTCCA GCAAACCGGC 
2) INFORMATION FOR SEQ ID NO: 15: 

■i: SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



■xi/ SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

AACCGCGGA CTGCGCATCA TGCCTCGCAC CTCTCTCCT 

Z INFORMATION FOR SEQ ID NO: 16: 

i. SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
(2) TYPE: nucleic acid 
(C) STRANDEDNESS: sincle 
f D) TOPOLOGY: bcth 



WO 94/03612 



PCI7FI93/00310 



95 - 



1X1) SEQUENCE DESCRIPTION: SEQ ZD NO : It 



INFORMATION FOR SEQ ID NO: 17: 

. i: SEQUENCE CHARACTERISTICS: 

<n) LENGTH: 14 amino acids- 
■5) TYPE: amino acid 
' D ) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Leu Ala Val Pro Ala Ser Arg Asn Gin Ser Ser Glv Asd Thr 
1 5 10 

(2) INFORMATION FOE SEQ ID NO: 18: 

•i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Arg His Gly Xaa Arg Xaa Pro 

1 . 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(D) TOPOLOGY: both 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

Lvs Asd Pro Arg Ala 
1 " " 5 

:2) INFORMATION FOR SEQ ID NO : 2 0 ; 

•i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(E) TYPE: amino acid 
(D) TOPOLOGY: both 



■xi; SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 

Tvr Tvr Gly His Leu Glv Ala Glv Asn Pro Leu Gly Pro Thr Gin 
1* 5 1C 15 



INFORMATION FOR SEQ ID NO: 21: 

i) SEQUENCE CHARACTERISTICS: 

■; A ) LENGTH: 11 amino acids 
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SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
Gly Tyr Val Gin Asn Tyr Vai G 1 r. Me: Gi 



INFORMATION FOR SEQ ID NO:22: 

•i? SEQUENCE CHARACTERISTICS: 
\A; LENGTH: 9 amino acids 
:3) TYPE: amino acid 
;D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 6 . .7 

(D) OTHER INFORMATION: /label= Peptide 

/note= "When deduced from the DNA seauence the 
amino acids at positions 5 and 7 were found to b~ 
serine." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Ala Gin Pro Glv Gin Ala Ala Pro Lvs 
1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Leu Tyr Val Glu Met Met Gin Asn Gin Ala Glu Gin Thr Pro Leu Val 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 24: 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: both 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Leu Tyr Val Glu Met Met Gin Cys Gin Ala Glu Gin Glu Pro Leu Val 



2; INFORMATION FOR SEQ ID NO: 25: 

»i> SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
IE) TYPE: amino acid 
(D) TOPOLOGY: both 



?he 



WO 94/03612 



PCT/FI93/00310 



- 97 - 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 

■i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
{ 5 > TYPE: amino acid 
iD) TOPOLOGY: both 



.xij SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
?he lie Glu Gly Phe Gin Ser Asp Lys 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : both 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO;27: 

Tvr Ala Phe Leu Lys 
5 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Gly Leu Ser Phe Ala Arg 
I 5 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Val lie Ala Ser Gly Glu Lys 
1 5 

{2) INFORMATION FOR SEQ ID NO : 3 0 : . 

: i i SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 4 amino acids 
= ) TYPE: amino acid 
*D) TOPOLOGY: both 



SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
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INFORMATION FOR SEQ ID NO : 3 1 : 

i: SEQUENCE CHARACTERISTICS: 
'.Aj LENGiK: 9 amino acics 
iS) TYPE: amino acid 
ID) TOPOLOGY: both 



xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 
?he Tyr Gin Arg Asp Ser Phe Vai Arg 



\2) INFORMATION FOR SEQ ID NO;32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
<B) TYPE; amino acid 
(D) TOPOLOGY: both 



'.xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
Asp Ser Phe Val Arg 

(2) INFORMATION FOR SEQ ID NO: 33: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: both 



30 



;xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 



il Leu Val Asn Asd 
5 



25 ;2; INFORMATION FOR SEQ ID NO: 34: 



'i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



*-.xii SEQUENCE DESCRIPTION: SEQ ID NO:34; 

Tyr Glu Ser Leu Gin 
^ 5 

I INFORMATION FOR SEQ ID NO : 3 5 : 

i : SEQUENCE CHARACTERISTICS: 
''A) LENGTH: 6 amino acids 
\B) TYPE: amino acid 
TOPOLOGY: both 



xi; SEQUENCE DESCRIPTION: SEQ ID NO : 3 = 
4 q Tyr Glu Ser Leu Thr Arg 
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INFORMATION FOR SEQ ID NO : 3 6 : 

. 1 < SEQUENCE CHARACTERISTICS: 
iA) LENGTH: 1 amino acids 
{ 3 ) TYPE: amino acid 
■D) TOPOLOGY: both 



•.Xl ) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 

Ser Ala Ala Ser Leu As.n Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 37: 

10 Ki) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 7 : 

15 Leu Lys Asp Pro Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 

ti) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
20 (B) TYPE: amino acid 

<D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 

Val lie Ala Ser Gly Glu Lys 
1 5 

25 (2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 amino acids 
{ B ) TYPE: amino acid 
(D) TOPOLOGY: both 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Tvr Pro Thr Glu Ser Lys 
I* 5 

( 2) INFORMATION FOR SEQ ID NO: 40: 

=.i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 5 amine acids 

(3) TYPE: amino acid 
•D) TOPOLOGY: both 



; X 1 ) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 : 
Tvr Phe Asn Xaa Giy 

40 5 
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INFORMATION "0?. SEQ ID NO : 4 1 : 

•ii SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 10 amino acids 
i'3) TYPE: amino acid 
(D) TOPOLOGY: both 



< ix) FEATURE: 

(A) NAME /KEY: Peotide 

(B) LOCATION: 3." 8 

■D) OTHER INFORMATION: /Iabei= Peptide 

/note= "The following are alternative amino acids 
at these positions: Proline at 2, Phenylalanine 
at 4, Serine at 6, Leucine at 7, and Valine at 8." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 
Leu Glu Asn Asp Leu Asp Gly Phe Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Leu Glu Asn Asd Leu Ser Gly Val Thr Leu Thr 
- 5 10 

(2) INFORMATION FOR SEQ ID NO: 43: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /labels Peptide 

/note= "The amino acid at oositicn 17 mav also be 
Tyrosine 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

Tyr Tyr Gly His Gly Ala Gly Asn Pro Leu Glv Pro Thr Gin Glv Val 
1 5 10 15' 

Gly Ala Asn Glu 

INFORMATION FOR SEQ ID NO:44: 

.i; SEQUENCE CHARACTERISTICS: 
; A ) LENGTH: 3 amino acids 
' B ) TYPE: amino acid 
:D) TOPOLOGY: both 
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'.xi/ SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
Iru lie Ala 



INFORMATION FOR SEQ ID NO : 4 5 : 

SEQUENCE CHARACTERISTICS: 
i A) LENGTH: 8 amino acids 
■3) TYPE: amino acid 
(D ) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Val Thr Phe Ala Gin Va.1 Leu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

( B ) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Phe lie Glu Gly Phe Gin Ser Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(2) TYPE: amino acid 
(D) TOPOLOGY: both 



iix) FEATURE: 

(A) NAME /KEY : Peptide 
(E) LOCATION: 1 

{D) OTHER INFORMATION: /label- Peptide 

/note= "The amino acid at position 1 may also be 
Asparagine . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

Aso Tvr Leu Gin Ser Leu Lys . 
1 5 



[2) INFORMATION FOR SEQ ID NO : 4 8 : 

.i- SEQUENCE CHARACTERISTICS: 
A) LENGTH: 8 amino acids 
; 3 ) TYPE: amino acid 
:Z) TOPOLOGY: both 
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■.xi) SEQUENCE DESCRI PTION : SEQ ID NO : 4 6 : 
Asn He Glu Pro ?he Gin Vai A sr. 



INFORMATION FOR SEQ ID NO : 4 9 : 

'i> SEQUENCE CHARACTERISTICS: 
iA) LENGTH: 6 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(Xi; SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
Val Leu Val Asn Asp Arg 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

Leu Ala Val Pro Ala Ser Arg Asd Gin Ser Thr Xaa Asd Thr 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



(ix> FEATURE: 

(A) NAME / KEY : Peotide 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /labels Peptide 

/note= "When deduced from the DNA sequence the 
amino acid at position 1 was found to be 
cysteine . " 



t'xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
Arg Ser Ala 



INFORMATION FOR SEQ ID NO: 52: 

\i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base Dairs 
(E) TYPE: nucleic acid 
iC) STRANDEDNESS: sinqle 
f D) TOPOLOGY: both 
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I N FO RKAT ION FOP. SEQ ID NO: 53: 

. i ; SEQUENCE CHARACTERISTICS: 

(Ai LENGTH: 14 amino acids 
{Hi TYPE: amino acid 
;D) TOPOLOGY: both 



'.xij SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

-he Ser Tvr Gly Ala Ala lie Pro Gin Ser Thr Gin Glu Lys 
1 * 5 1C 

(2) INFORMATION FOR SEQ ID NO: 54: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gin Phe Ser Gin Glu Phe Arg Asp Gly Tyr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 55: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

( B ) TYPE : amino acid 
<D) TOPOLOGY: both 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Tvr Glv Glv Asn Glv Pro Tyr 
1* 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: both 



ixi* SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Val Ser Tyr Gly lie Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 57:. 

i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
; B ) TYPE: amino acid 
ID) TOPOLOGY: both 



SEQUENCE DESCRIPTION: SEQ ID NO: 57 
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INFORMATION FOR SEQ ID NO: 53: 

<i> SEQUENCE CHARACTERISTICS- 
t^5 TH: 5 a --"ino acids 
• = ) ^P-: amino acid 
ID) TO?OLCGY: bo Eh 



'Xi; SEQUENCE DESCRIPTION: SEQ ID NO:5S: 

Asp He Giu Glu Ala Leu Ala Lvs 

5 

■■2) INFORMATION FOR SEQ ID NO.-59: 
(i) SEQUENCE CHARACTERISTICS - 

IB) TYPE : ammo acid 
ID) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:59 : 

Ala Arg Tyr Gly His Leu Trp Asn Gly Giu Th: 

10 

2) INFORMATION FOR SEQ ID NO : 6 0 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
Val Val Pro Phe Phe Ser Ser Gly 

i' INFORMATION FOR SEQ ID NO: 61: 

H) SEQUENCE CHARACTERISTICS • 
(A) LENGTH: 7 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: both 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 1 : 

Phe Ser Ser Gly Tvr Glv Arg 
1 5 

INFORMATION FOR SEQ ID NO: 62: 

SEQUENCE CHARACTERISTICS: 
LENGTH: 5 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: both 
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(2) INFORMATION FOR SEQ ID NO: 63: 

."i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: ooth 



'.xii SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
Val Ala Phe Gly Asn Pro Tyr 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
( E ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: both 



iix) FEATURE : 

(A) NAME /KEY : modi f ied_base 
(3) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GTRCCNCTYK CNATRGG 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CARCTNCCNC ARTTYAA 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(E) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
.(D) TOPOLOGY: both 



;xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
TTCCGAG TCCGAGGTCA TGGGCGCG 
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24. The host cell of claim 23, wherein said genetic sequence encoding said 
phytase has the DNA sequence of SEQ ID No. :7:. or encodes an 
enzymatically active derivative thereof. 

25. The host cell of claim 22, wherein said pH 2.5 acid phosphatase has 
5 the amino acid sequence of SEQ ID No. :2: , or an enzymatically active 

derivative thereof. 

26. The host cell of claim 25, wherein said genetic sequence encoding said 
pH 2.5 acid phosphatase has the DNA sequence of SEQ ID No. :1:. 
or encodes an enzymatically active derivative thereof. 

10 27. The host cell of claim 19, wherein said the sequences encoding said 
phytate degrading enzyme are operably linked to sequences encoding 
a signal sequence. 

28. The host cell of claim 27, wherein said signal sequence is selected 
from the group consisting of the signal sequence of CHBI, CBHII, 

15 EGL EGII, pH 2.5 acid phosphatase and phytase. 

29. The host cell of claim 19, wherein said genetic sequence encoding said 
phytate degrading enzyme is provided by a vector selected from the 
group consisting of pALK171, pALK172, pALK173A and pALK173B. 
pALK532, pALK533 and a fragment thereof that encodes said phytate 

20 degrading enzyme. 

30. The host cell of claim 19, wherein said host cell is Trichoderma reesei. 

31. A method for overexpressing phytate degrading enzymes in 
Trichoderma. wherein the method comprises the steps of: 

(a) preparing a recombinant construct, said recombinant construct 
25 containing a gene encoding a phytate degrading enzyme; 
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(b) transforming a Trichoderma host with said construct: 

(c) cultivating the transformed Trichoderma under conditions 
suitable for said Trichoderma and for expression of said phytate 
degrading enzyme. 

5 32. The method of claim 31. wherein said DNA construct is provided to 
said Trichoderma host in a linear form. 

33. The method of claim 31, wherein said DNA construct is provided to 
said Trichoderma host in a circuiar plasmid form. 



10 



34. The method of claim 32, wherein said linear form lacks bacterial 
sequences. 

35. The method of claim 31, wherein said phytate degrading enzyme is 
selected from the group consisting of phytase, pH 2.5 acid 
phosphatase, an enzymatically active derivative of said phytase and an 
enzymatically active derivative of said pH 2.5 acid phosphatase. 



15 36. 



The method of claim 34, wherein said phytase has the amino acid 
sequence of SEQ ID No. :8:, or an enzymatically active derivative 
thereof. 



20 



37. 



The method of claim 35. wherein the genetic sequence encoding said 
phytase has the DNA sequence of SEQ ID No. :7:, or encodes an 
enzymatically active derivative thereof. 
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The method of claim 34. wherein said pH 2.5 acid phosphatase has the 
amino acid sequence of SEQ ID No. :2:, or an enzymatically active 
derivative thereof. 

The method of claim 37. wherein said genetic sequence encoding said 
pH 2.5 acid phosphatase has the DNA sequence of SEQ ID No. 
or encodes an enzymatically active derivative thereof. 

The method of claim 31, wherein said the genetic sequences encoding 
said phytate degrading enzyme are operably linked to genetic sequences 
encoding a signal sequence. 

The method of claim 39, wherein said signal sequence is^selected from 
the group consisting of the signal sequence of CHBI, CBHII, EGI, 
EGII, pH 2.5 acid phosphatase and phytase. 

The method of claim 31, wherein said recombinant construct is 
selected from the group consisting of pALK171, pALK172, 
pALK173A, pALK173B, pALK532, pALK533 and a fragment thereof 
that encodes said phytate degrading enzyme. 

The method of claim 14. wherein said host cell is Trichoderma reesei. 

A method of cloning a fusion of the Trichoderma reesei cbh\ promoter 
to the Aspergillus niger var. awamori phytase signal sequence, 
wherein said method comprises polymerase chain amplification of a 
fusion of the cbh\ promoter with phytase signal sequence using a 5' 
primer having SEQ ID No. :9: and a 3' primer having SEQ ID No. 
:10:. 

A method of cloning a fusion of DNA encoding the Trichoderma reesei 
CBH1 signal sequence to the Aspergillus niger var. awamori mature 
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phytase coding sequence, wherein said method comprises polymerase 
chain amplification of the DNA encoding said CBH1 signal sequence 
with a 5' primer having SEQ ID No. :11: and polymerase chain 
amplification of the phytase coding sequence with a 3' primer havins 
5 SEQ ID No. :12:. 

46. A method of cloning a fusion of the Trichoderma reesei cbh 1 promoter 
to the Aspergillus niger var. awamori pH 2.5 acid phosphatase signal 
sequence, wherein said method comprises polymerase chain 
amplification of the a sequence encoding a fusion of said cbh I 

0 promoter and said signal sequence with a 5' primer having SEQ ID 

No. :15: and a 3' primer having SEQ ID No. :14:. 

47. A method of cloning a fusion of DNA encoding the Trichoderma reesei 
CBH1 signal sequence to the Aspergillus niger war. awamori coding 
sequence of the mature pH 2.5 acid phosphatase, wherein said method 

> comprises polymerase chain amplification of the DNA encoding said 

CBH1 signal sequence with a 5' primer having SEQ ID No. :13: and 
polymerase chain amplification of said pH 2.5 acid phosphatase coding 
sequence with a 3' primer having SEQ ID No. :14:. 
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PEPTIDE #816: Arg His Gly Glu Arg Tyr Pro Ser Pro Ser Ala 
OUGO PHY-31 3' -GTG CCG CTC GCI ATG GG- 5' 

A A T T A 

T 
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PEPTIDE #1110: Gin Leu Pro Gin Phe Lys 
OUGO PHY-34 5' - CAA CTG CCG CAA TTT AA -3' 
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OUGO PHY-35 5' - CAA TTA CCG CAA TTT AA -3' 

G G A G C 
T 
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GCATGCTGGA CCGCAATCTC CGATCGCCGG GTATAAAAGG TCCTCCAAAC CCCTCTCGGT 60 
CGATATGTAC CCCGCTCGTC ATCTCCAATC CTCTCGAGAG CACCTTCTCC AGCTTTTGTC 120 



AATTGTACCT TCGCA ATG CCT CGC ACC TC7 CTC CTC ACC CTG GCC TGT GCT 171 
Met Pro Arg Thr Ser Leu Leu Thr Leu Ala Cys Ala 
1 5 10 
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GGC CAC CTC TGG AAC GGT GAG ACG GTC GTG CCC TTC TTT TCT AGT GGC 651 

Gly His Leu Trp Asn Gly Glu Thr Val Vol Pro Phe Phe Ser Ser Gly 
160 165 170 

TAC GGA CGT GTC ATC GAG ACG GCC CGC AAG TTC GGT GAG GGT TTC TTT 699 
Tyr Gly Arg Vat He Glu Thr Ala Arg Lys Phe Gly Glu Gly Phe Phe 
175 180 185 

GGC TAC AAC TAC TCC ACC AAC GCT GCC CTC AAC ATC ATC TCC GAG TCC 747 
Gly Tyr Asn Tyr Ser Thr Asn Ala Ala Leu Asn He He Ser Glu Ser 
190 195 200 

GAG GTC ATG GGC GCG GAC AGC CTC ACG CCC ACC TGT GAD ACC GAC AAC 795 
Glu Val He-t Gly Ala Asp Ser Leu Thr Pro Thr Cys Asp Thr Asp Asn 
205 210 215 220 

GAC CAG ACC ACC TGC GAC AAC CTG ACT TAC CAG CTG CCC CAG TTC AAG 843 
Asp Gin Thr Thr Cys Asp Asn Leu Thr Tyr Gin Leu Pro Gin Phe Lys 
225 230 235 

GTC GCT GCT GCC CGC CTA AAC TCC CAG AAC CCC GGC ATG AAC CTC ACC 891 
Val Ala Ala Alo Arg Leu Asn Ser Gin Asn Pro Gly Me-t Asn Leu Thr 
240 245 250 



GCA TCT GAT GTC TAC AAC CTG ATG G GTATGTGAT TACGGTACAA TCATTGGCTC 945 
Ala Ser Asp Val Tyr Asn Leu Met 
255 260 

AAACCTCCAG CTGACAGCAT CCTAG TT ATG GCC TCC TTT GAG CTC AAT GCT 996 

Val Hei Ala Ser Phe Glu Leu Asn Ala 
265 



CGT CCC TTC TCC AAC TGG ATC AAC GCC TTT ACC CAG GAC GAA TGG GTC 1044 
Arg Pro Phe Ser Asn Trp He Asn Ala Phe Thr Gin Asp Glu Trp Val 
270 275 280 285 



AGC TTC GGT TAC GTT GAG GAT TTG AAC TAC TAC TAC TGC GCT GG G 1089 
Ser Phe Gly Tyr Val Glu Asp Leu Asn Tyr Tyr Tyr Cys Ala Gly 
290 295 300 

FIG.2B 
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CATCCAGGCA CCCTTTCCCA ACGGGGG AAC TTCCGTTGTC CACGTGCCCT GGTTCAGCCA 60 

ATCAAAGCGT CCCACGGCAA TGCTGGATCA ACGATCAACT TGAATGCAAT AAATGAAGAT 120 

GCAACTAACA CCATCTGTTG CCTTTCTCTC GAGAAAGCTC CTCCACTTCT CACACTAGAT 180 

TTATCCGTTC CTTGTCGACT TCCCGTCCCA TTCGGCCTCG TCCACTGAAG ATCTATCCCA 240 

CCATTGCACG TGGGCCACCT TTGTGAGCTT CTAACCTGAA CTGGTAGAGT ATCACACAAC 300 

ATGCGAAAGT GGGATGAAGG GGTTATATGA GGACCGTCCG GTCCGGCGCG ATGGCCGTAG 360 

CTGCCAATCG CTGCTGTGCA AGAAATTTCT TCTCATAGGC ATC ATG GGC GTC TCT 415 

Me-t Gly Val Ser 

GCT GTT CTA CTT CCT TTG TAT CTC CTA GCT GG GTATGCTAAG CACCGCTATC 467 
Ala Val Leu Leu Pro Leu Tyr Leu Leu Ala Gl 
r 15 -10 -5 

TAAGTCTGAT AAGGACCCTC TTTGCCGAGG GCCCCTGAAG CTCGGACTGT GTGGGACTAC 527 

TGATCGCTGA CAATCTGTGC AG A GTC ACC TCC GGA CTG GCA GTG CCC GCC TCG 580 

y Val Thr Ser Gly Leu Ala Val Pro Ala Ser 

1 5 

AGA AAT CAA TCC ACT TGC GAT ACG GTC GAT CAA GGG TAT CAA TGC TTC 628 
Arg Asn Gin Ser Thr Cys Asp Thr Val Asp Gin Gly Tyr Gin Cys Phe 
10 15 20 

TCC GAG ACT TCG CAT CTT TGG GGT CAA TAC GCG CCG TTC TTC TCT CTG 676 
Ser Glu Thr Ser His Leu Trp Gly Gin Tyr Ala Pro Phe Phe Ser Leu 
25 30 35 

GCA AAC GAA TCG GCC ATC TCC CCT GAT GTG CCC GCC GGT TGC AGA GTC 724 
Ala Asn Glu Ser Ala lie Ser Pro Asp Val Pro Ala Gly Cys Arg Val 
40 45 50 

ACT TTC GCT CAG GTC CTC TCC CGT CAT GGA GCG CGG TAT CCG ACC GAG 772 
Thr Phe Ala Gin Val Leu Ser Arg His Gly Ala Arg Tyr Pro Thr Glu 
55 60 65 70 

TCC AAG GGC AAG AAA TAC TCC GCT CTC ATT GAG GAG ATC CAG CAG AAC 820 
Ser Lys Gly Lys Lys Tyr Ser Ala Leu lie Glu Glu lie Gin Gin Asn 

75 80 85 

GTG ACC ACC TTT GAT GGA AAA TAT GCC TTC CTG AAG ACA TAC AAC TAC 868 
Val Thr Thr Phe Asp Gly Lys Tyr Ala Phe Leu Lys Thr Tyr Asn Tyr 
90 95 100 

AGC TTG GGT GCA GAT GAC CTG ACT CCC TTC GGA GAG CAG GAG CTA GTC 916 
Ser Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu Gin Glu Leu Val 
105 110 115 

AAC TCC GGC ATC AAG TTC TAC CAG CGA TAC GAA TCG CTC ACA AGG AAC 964 
Asn Ser Gly lie Lvs Phe Tyr Gin Arg Tyr Glu Ser Leu Thr Arg Asn 
120 " 125 130 

ATC ATT CCG TTC ATC CGA TCC TCT GGC TCC AGC CGC GTG ATC GCC TCC 1012 
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lie He Pro Phe lie Arg Ser Ser Gly Ser Ser Arg val lie Ala Ser 
135 140 145 150 

GGC GAG AAA TTC ATT GAG GGC TTC CAG AGC ACC AAG CTG AAG GAT CCT 1060 
Gly Glu Lys Phe He Glu Gly Phe Gin Ser Thr Lys Leu Lys Asp Pro 

155 160 165 

CGT GCC CAG CCG GGC CAA TCG TCG CCC AAG ATC GAC GTG GTC ATT TCC 1108 
Arg Ala Gin Pro Gly Gin Ser Ser Pro Lys lie Asp Val Val He Ser 
170 175 180 

GAG GCC AGC TCA TCC AAC AAC ACT CTC GAC CCA GGC ACC TGC ACT GTC 1156 
Glu Ala Ser Ser Ser Asn Asn Thr Leu Asp Pro Gly Thr Cys Thr Val 
185 - 190 195 

TTT GAA GAC AGC GAA TTG GCC GAT ACC GTC GAA GCC AAT TTC ACC GCC 1204 
Phe Glu Asp Ser Glu Leu Ala Asp Thr Val Glu Ala Asn Phe Thr Ala 
200 205 210 

ACG TTC GCC CCC TCC ATT CGT CAA CGT CTG GAG AAC GAC CTG TCT GGC 1252 
Thr Phe Ala Pro Ser lie Arg Gin Arg Leu Glu Asn Asp Leu Ser Gly 
215 220 225 ^ 230 

GTG ACT CTC ACA GAC ACA GAA GTG ACC TAC CTC ATG GAC ATG TGC TCC 1300 
Val Thr Leu Thr Asp Thr Glu Val Thr Tyr Leu Met Asp Met: Cys Ser 

235 240 245 

TTC GAC ACC ATC TCC ACC AGC ACC GTC GAC ACC AAG CTG TCC CCC TTC 1348 
Phe Asp Thr He Ser Thr Ser Thr Val Asp Thr Lys Leu Ser Pro Phe 
250 255 260 

TGT GAC CTG TTC ACC CAT GAC GAA TGG ATC CAC TAC GAC TAC CTC CAG 1396 
Cys Asp Leu Phe Thr His Asp Glu Trp He His Tyr Asp Tyr Leu Gin 
265 270 275 

TCC CTG AAA AAA TAC TAC GGC CAT GGC GCA GGT AAC CCG CTC GGC CCG 1444 
Ser Deu Lys Lys Tyr Tyr Gly His Gly Ala Gly Asn Pro Leu Gly Pro 
280 285 290 

ACC CAG GGC GTC GGC TAC GCT AAC GAG CTC ATC GCC CGT CTC ACC CAC 1492 
Thr Gin Gly Val Gly Tyr Ala Asn Glu Leu He Ala Arg Leu Thr His 
295 300 305 310 

TCG CCT GTC CAC GAT GAC ACC AGC TCC AAC CAC ACC TTG GAC TCG AAC 1540 
Ser Pro Val His Asp Asp Thr Ser Ser Asn His Thr Leu Asp Ser Asn 

315 320 325 

CCA GCT ACC TTC CCG CTC AAC TCT ACT CTC TAC GCG GAC TTT TCC CAC 1588 
Pro Ala Thr Phe Pro Leu Asn Ser Thr Leu Tyr Ala Asp Phe Ser His 
330 335 340 

GAT AAC GGC ATC ATC TCT ATC CTC TTT GCT TTG GGT CTG TAC AAC GGC 1636 
Asp Asn Gly He He Ser He Leu Phe Ala Leu Gly Leu Tyr Asn Gly 
345 350 355 

ACT AAG CCG CTG TCT ACC ACG ACC GTG GAG AAT ATC ACC CAG ACA GAT 1684 
Thr Lys Pro Leu Ser Thr Thr Thr Val Glu Asn He Thr Gin Thr Asp 
360 365 370 
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GGG TTC TCG TCT GCT TGG ACG GTT CCG TTT GCT TCG CGT CTG TAC GTC 173 2 
Gly Phe Ser Ser Ala Trp Thr Val Pro Phe Ala Ser Arg Leu Tyr Val 
375 380 385 390 



GAG ATG ATG CAG TGC CAG GCC GAG CAG GAG CCG CTG GTC CGT GTC TTG 1780 
Glu Met Met: Gin Cys Gin Ala Glu Gin Glu Pro Leu Val Arg Val Leu 

395 400 405 

GTT AAT. GAT CGC GTT GTC CCG CTG CAT GGG TGT CCA ATT GAT GCT TTG 1828 
Val Asn Asp Arg Val Val Pro Leu His Gly Cys Pro lie Asp Ala Leu 
410 415 420 

GGG AGA TGT ACC CGG GAT AGC TTT GTG AGG GGG TTG AGC TTT GCT AGA 1876 
Gly Arg Cys Thr Arg Asp Ser Phe Val Arg Gly Leu Ser Phe Ala Arg 
425 430 435 

TCT GGG GGT GAT TGG GCG GAG TGT TCT GCT .TAGCTGAACT ACCTTGATGG 1926 
Ser Gly Gly Asp Trp Ala Glu Cys Ser Ala 
440 445 



ATGGTATGTA TCAATCAGAG TACATATCAT TACTTCATGT ATGTATTTAC GAAGATGTAC 1986 



ATATCGAAAT ATCGATGATG ACTACTCCGG TAGATATTTG GTCCCCTTCT ATCCTTCGTT 2046 



CCACAACCAT CGCACTCGAC GTACAGCATA ATACAACTTC AGCATTAACA AACGAACAAA 2106 
TAATATTATA CACTCCTCCC CAATGCAATA ACAACCGCAA TTCATACCTC ATATAGATAC 2166 
AATACAATAC ATCCATCCCT ACCCTCAAGT CCACCCATCC CATAATCAAA TCCCTACTTA 2226 



CTCCTCCCCC TTCCCAGAAC CCACCCCCGA AGGAGTAATA GTAGTAGTAG AAGAAGCAGA 2286 



CGACCTCTCC ACCAACCTCT TCGGCCTCTT ATCCCCATAC GCTATACACA CACGAACACA 2346 



CCAAATAGTC AGCATGC 
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5- C AA£CQC fiGACTG 030 ATC ATG GGC GTC TCT GCT GTT CT 
Sad\ 

19 nts "tail" of cbh\ 20 nts cf the phytase signal sequence 
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XhcA 
phytase sequence 
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TGAGTTTACC ATTTGATCCA TTATTGTCTT GGATCAGCTA ACGATCGATA G T CCC 1144 

Pro 

GGT GAC AAG AAC ATG GCT GCT GTG GGT GCC GTC TAC GCC AAC GCC AGT 1192 
Gly Asp Lys Asn Met Ala Ala Val Gly Ala Val Tyr Ala Asn Ala Ser 
305 310 315 

CTC ACC CTC CTG AAC CAG GGA CCC AAG GAA GCC GGC TCC TTG TTC TTC 1240 
Leu Thr Leu Leu Asn Gin Gly Pro Lys Glu Ala Gly Ser Leu Phe Phe 
320 325 330 



AAC TT GTACGTTCTCG GCAGAATCAG AGTCTCACAA AAAGAAACTC TTCACTAACA 1296 
Asn Phe 
335 

TATAGTAG T GCC CAC GAC ACC AAC ATC ACC CCC ATC CTC GCC GClTcTA 1344 
Ala His Asp Thr Asn lie Thr Pro He Leu Ala Ala Leu 
340 345 
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GCG TCC TAC CCG CAG TAT CTG AGC TTC TGG TGG AAC TAC AAC ACC ACG 1680 
Ala Ser Tyr Pro Gin Tyr Leu Ser Phe Trp Trp Asn Tyr Asn Thr Thr 
445 450 455 460 

ACG GAG CTG AAC TAC CGC TCT AGC CCT An GCC TGC CAG GAG GGT GAT 1728 
Thr Glu Leu Asn Tyr Arg Ser Ser Pro He Ala Cys Gin Glu Gly Asp 
465 470 475 

GCT ATG GAC TAGATGCAGA GGGGTAGGTC CCGGGATACT TTAGTGATGA 1777 
Ala Het Asp ^ 



TTGATATTCA AGTTTGGTGG TGACGATCAC CTTGTTAATA GTCTTGTACA GTCATACGGT 1837 

GAATGTAAAT AATGATAATA GCAATGATAC ATGTTGGAAT CTCGTTTTGT TCTTTGTGTG 1897 

CATAGGCGCT TTGGGGGTGT ATTTTTAGGC GTTAGACTTA nnCAAnC GTGTATAATG 1957 

CGGTCAGTAA ATGAATCATC AATTATTCAA ATGCAATGCT GTATACGTGA AACTATTGGG 2017 

TTAAGACGCA GCTACTAGCT GACTGCTTGG TTACTTTCTG TGTACACCGC ATGC 2071 

>Sphl 
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PEPTIDE f 792 
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OLIGONUCLEOTIDE 1 
TAT TAT GGT CAT GGT GC 
C C C C C 

G- G 

A A 



13 14 15 16 17 18 19 20 21 22 23 24 
- Thr Gin Gly Val Gly Tyr Ala Asn Glu Leu He Ala - 

OLIGONUCLEOTIDE 2 
CAA GGT GTT GGT TAT GC 
' G C C C C 

G G G 

AAA 

PEPTIDE # 420 

1234567 8 9 10 11 
Leu Tyr Val Glu Met Met Gin (Asn) Gin Ala (Glu) 

12 13 14 15 16 
Gin (Thr) Pro Leu Val 

OLIGONUCLEOTIDE 3 
TAT GTT GAA ATG ATG CAA AA 
C C G G 

G 

A 

OLIGONUCLEOTIDE 4 

ATG ATG CAA AAT CAA GCT GAA CA 



FIG.3 

« .RSXITI m= SHEET 




SUBSTITUTE SHEET 



WO 94/03612 



PCT/F193/00310 



8/22 

CATCCAGGCA CCCTTTCCCA ACGGGGGAAC TTCCGTTGTC CACGTGCCCT GGTTCAGCCA 60 

ATCAAAGCGT CCCACGGCAA TGCTGGATCA ACGATCAACT TGAATGCAAT AAATGAAGAT 120 

GCAACTAACA CCATCTGTTG CCTTTCTCTC GAGAAAGCTC CTCCACTTCT CACACTAGAT 180 

TTATCCGTTC CTTGTCGACT TCCCGTCCCA TTCGGCCTCG TCCACTGAAG ATCTATCCCA 240 

CCATTGCACG TGGGCCACCT TTGTGAGCTT CTAACCTGAA CTGGTAGAGT ATCACACAAC 300 

ATGCGAAAGT GGGATGAAGG GGTTATATGA GGACCGTCCG GTCCGGCGCG ATGGCCGTAG 360 

CTGCCAATCG CTGCTGTGCA AGAAATTTCT TCTCATAGGC ATC ATG GGC GTC TCT 415 

Met Gly Val Ser 



GCT GTT CTA CTT CCT TTG TAT CTC CTA GCT GG GTATGCTAAG 457 
Ala Val Leu Leu Pro Leu Tyr Leu Leu Ala Gly 
-15 - 10 -5 

CACCGCTATC tAAGTCTGAT AAGGACCCTC TTTGCCGAGG GCCCCTGAAG CTCGGACTGT 517 

GTGGGACTAC TGATCGCTGA CAATCTGTGC AG A GTC ACC TCC GGA CTG GCA 568 

Val Thr Ser Gly Leu Ala 
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TAT CCG ACC GAG TCC AAG GGC AAG AAA TAC TCC GCT CTC ATT GAG GAG 808 

Tyr Pro Thr Glu Ser Lys Gly Lys Lys tyr Ser Ala Leu He Glu Glu 
70 75 80 

ATC CAG CAG AAC GTG ACC ACC TTT GAT GGA AAA TAT GCC TTC CTG AAG 856 
He Gin Gin Asn Vol Thr Thr Phe Asp Gly Lys Tyr Ala Phe Leu Lys 
85 90 95 

ACA TAC AAC TAC AGC TTG GGT GCA GAT GAC CTG ACT CCC TTC GGA GAG 904 
Thr Tyr Asn Tyr Ser Leu Gly Ala Asp Asp Leu Thr Pro Phe Gly Glu 
100 105 no 

CAG GAG CTA GTC AAC TCC GGC ATC AAG TTC TAC CAG CGA TAC GM TCG 952 
Gin Glu Leu Val Asn Ser Gly lie Lys Phe Tyr Gin Arg Tyr Glu Ser 
115 120 125 130 

CTC ACA AGG AAC ATC ATT CCG TTC ATC CGA TCC TCT GGC TCC AGC CGC 1000 
Leu Thr Arg Asn He He Pro Phe Lie Arg Ser Ser Gly Ser Ser Arg 
135 140 145 

GTG ATC GCC TCC GGC GAG AAA TTC An GAG GGC TTC CAG AGC ACC AAG 1048 
Val He Ala Ser Gly Glu Lys Phe He Glu Gly Phe Gin Ser Thr Lys 
150 155 160 

CTG AAG GAT CCT CGT GCC CAG CCG GGC CAA TCG TCG CCC AAG ATC GAC 1096 
Leu Lys Asp Pro Arg Ala Gin Pro Gly Gin Ser Ser Pro Lys lie Asp 
165 170 175 

GTG GTC AH TCC GAG GCC AGC TCA TCC AAC AAC ACT CTC GAC CCA GGC 1144 
Val Val He Ser Glu Ala Ser Ser Ser Asn Asn Thr Leu Asp Pro Gly 
180 185 190 

ACC TGC ACT GTC TTT GAA GAC AGC GAA TTG GCC GAT ACC GTC GAA GCC 1192 
Thr Cys Thr Val Phe Glu Asp. Ser Glu Leu Ala Asp Thr Val Glu Ala 
195 200 205 210 

AAT TTC ACC GCC ACG TTC GCC CCC TCC ATT CGT CAA CGT CTG GAG AAC 1240 
Asn Phe Thr Ala Thr Phe Ala Pro Ser He Arg Gin Arg Leu Glu Asn 
215 220 225 

GAC CTG TCT GGC GTG ACT CTC ACA GAC ACA GAA GTG ACC TAC CTC ATG 1288 
Asp Leu Ser Gly Val Thr Leu Thr Asp Thr Glu Val Thr Tyr Leu Mei 
230 235 240 



FIG.5B 
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GAC ATG TGC TCC TTC GAC ACC ATC TCC ACC AGC ACC GTC GAC ACC AAG 1336 
Asp Met Cys Ser Phe Asp Thr He Ser Thr Ser Thr Vol Asp Thr Lys 
245 250 255 

CTG TCC CCC TTC TGT GAC CTG TTC ACC CAT GAC GAA TGG ATC CAC TAC 1384 
Leu Ser Pro Phe Cys Asp Leu Phe Thr His Asp Glu Trp lie His Tyr 
260 265 270 

GAC TAC CTC CAG TCC CTG AAA AAA TAC TAC GGC CAT GGC GCA GGT AAC 1432 
Asp Tyr Leu Gin Ser Leu Lys Lys Tyr Tyr Gly His Gly Ala Gly Asn 
275 280 285 290 

CCG CTC GGC CCG ACC CAG GGC GTC GGC TAC GCT AAC GAG CTC ATC GCC 1480 
Pro Leu Gly Pro Thr Gin Gly Val Gly Tyr Ala Asn Glu Leu He Ala 
295 300 305 

CGT CTC ACC CAC TCG CCT GTC CAC GAT GAC ACC AGC TCC AAC CAC~ACC 1528 
Arg Leu Thr His Ser Pro Val His Asp Asp Thr Ser Ser Asn His Thr 
310 315 320 

TTG GAC TCG AAC CCA GCT ACC TTC CCG CTC AAC TCT ACT CTC TAC GCG 1576 
Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn Ser Thr Leu Tyr Ala 
325 330 335 

GAC TTT TCC CAC GAT AAC GGC ATC ATC TCT ATC CTC TTT GCT TTG GGT 1624 
Asp Phe Ser His Asp Asn Gly lie He Ser He Leu Phe Ala Leu Gly 
340 345 350 



CTG TAC AAC GGC ACT AAG CCG CTG TCT ACC ACG ACC GTG GAG AAT ATC 1672 
Leu Tyr Asn Gly Thr Lys Pro Leu Ser Thr Thr Thr Val Glu Asn He 
355 360 365 370 

ACC CAG ACA GAT GGG TTC TCG TCT GCT TGG ACG GTT CCG TTT GCT TCG 1720 
Thr Gin Thr Asp Gly Phe Ser Ser Ala Trp' Thr Val Pro Phe Ala Ser 
375 380 385 

CGT CTG TAC GTC GAG ATG ATG CAG TGC CAG GCC GAG CAG GAG CCG CTG 1768 
Arg Leu Tyr Val Glu Met Met Gin Cys Gin Ala Glu Gin Glu Pro Leu 
390 395 400 

GTC CGT GTC TTG GTT AAT GAT CGC GTT GTC CCG CTG CAT GGG TGT CCA 1816 
Val Arg Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys Pro 
405 410 415 

FIG.5C 
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ATT GAT GCT TTG GGG AGA TGT ACC CGG GAT AGC TTT GTG AGG GGG TTG 1864 
He Asp Ala Leu Gly Arg Cys Thr Arg Asp Ser Phe Val Arg Gly Leu 
420 425 430 

AGC TTT GCT AGA TCT GGG GGT GAT TGG GCG GAG TGT TCT GCT 1306 
Ser Phe Ala Arg Ser Gly Gly Asp Trp Ala Glu Cys Ser Ala 
435 440 445 

TAGCTGAACT ACCTTGATGG ATGGTATGTA TCAATCAGAG TACATATCAT TACTTCATGT 1966 

ATGTATTTAC GAAGATGTAC ATATCGAAAT ATCGATGATG ACTACTCCGG TAGATATTTG 2026 

GTCCCCTTCT ATCCTTCGTT CCACAACCAT CGCACTCGAC GTACAGCATA ATACAACTTC 2086 

AGCATTAACA AACGAACAAA TAATATTATA CACTCCTCCC CAATGCAATA ACAACCGCAA 2146 

TTCATACCTC ATATAGATAC AATACAATAC ATCCATCCCT ACCCTCAAGT CCACCCATCC 2206 

CATAATCAAA TCCCTACTTA CTCCTCCCCC TTCCCAGAAC CCACCCCCGA AGGAGTAATA 2266 

GTAGTAGTAG AAGAAGCAGA CGACCTCTCC ACCAACCTCT TCGGCCTCTT ATCCCCATAC 2326 

GCTATACACA CACGAACACA CCAAATAGTC AGCATGC 23&3 

FIG.5D 
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pALK171 FUSION 

5' -PRIMER (39-mer) 

3 - C AAC CGC GGA CTG CGC ATC ATG GGC GTC TCT GCT GTT CT 
Sad I 

19 nts "tail" of cbhl 20 nts OF THE PHYTASE SIGNAL 

(PROMOTER SEQUENCE) SEQUENCE 

3' -PRIMER (22-mer) 
5' - A TTT QIC GAG GCG GGG ACT GCC 
Xhd 

PHYTASE SEQUENCE 
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5' -PRIMER (46-mer) 

5'- CTC GGC CTT CTT GGC CAC AGC TCG TGCT CTG GCA GTC CCC GCC TCG 

Sf II 

28 nts "tail" of cbhl 18 nis OF THE PHYTASE 

(SIGNAL SEQUENCE) N-TERMINAL SEQUENCE 

3' -PRIMER (24-mer) 

5'- TTG GT G TCG AC G GTG CTG GTG GAG 
Sail 

PHYTASE SEQUENCE 
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pALI<532 FUSION 
5'- PRIMER (46-mer) 

5'- CTC GGC CTT CTT GGC C AC AGC TCG TGC TTT CTC CTA CGG CGC 

TGC C SfU 

28-nts tail of cbhl 18nts OF THE ACID PHOSPHATASE 

(SIGNAL SEQUENCE) N-TERMINAL SEQUENCE 



3'- PRIMER (30-mer) 

5'- GCC ATG GTT G TA CGC GT C CAG CAA ACC GGC 

Mlul 

pH 2.5 ACID PHOSPHATASE SEQUENCE 



pALK533 FUSION 
5'- PRIMER (39-mer) 

5'- CAA CCG CGG ACT GCG CAT CAT GCC TCG CAC CTC TCT CCT 
SacII 

19nts tail of cbhl 20nts OF THE ACID PHOSPHATASE 

(PROMOTER SEQUENCE] SIGNAL SEQUENCE 



3'- PRIMER (30-mer) 

5'- GCC ATG GTT G TA CGC. GT C CAG CAA ACC GGC 

Mlul 

pH2.5 ACID PHOSPHATASE SEQUENCE 
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